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O (57) Abstract: The present invention relates to nucleic acid fragments encoding amino acid sequences for flavonoid biosynthetic 
enzymes in plants, and the use thereof for the modification of, for example, flavonoid biosynthesis in plants, and more specifically the 
Q modification of the content of condensed tannins. In particularly preferred embodiments, the invention relates to the combinatorial 
nT expression of chalcone synthase (CHS) and/or dihydroflavonol 4-reductase (BAN) and/or leucoanthocyanidine reductase (LAR) in 
5^ plants to modify, for example, flavonoid biosynthesis or more specifically the content of condensed tannins. 
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Chalcone Synthase, Dihydroflavonol 4-reductase and Leucoanthocyanidine reductase from Clover, 
Medic, Ryegrass or Fescue. 

The present invention relates to nucleic acid fragments encoding amino 
acid sequences for flavonoid biosynthetic enzyme polypeptides in plants, and the 
use thereof for the modification of, for example, flavonoid biosynthesis in plants, 
5 and more specifically the modification of the content of condensed tannins. In 
particularly preferred embodiments, the invention relates to the combinatorial 
expression of chalcone synthase (CHS) and/or dihydroflavonol 4-reductase (BAN) 
and/or leucoanthocyanidine reductase (LAR) in plants to modify, for example, 
flavonoid biosynthesis or more specifically the content of condensed tannins. 

10 Flavonoids constitute a relatively diverse family of aromatic molecules that 

are derived from phenylalanine and malonyl-coenzyme A (CoA, via the fatty acid 
pathway). These compounds include six major subgroups that are found in most 
higher plants: the chalcones, flavones, flavonols, flavandiols, anthocyanins and 
condensed tannins (or proanthocyanidins). A seventh group, the aurones, is 

1 5 widespread, but not ubiquitous. 

Some plant species also synthesize specialised forms of flavonoids, such 
as the isoflavonoids that are found in legumes and a small number of non-legume 
plants. Similarly, sorghum, maize and gloxinia are among the few species known 
to synthesize 3-deoxyanthocyanins (or phlobaphenes in the polymerised form). 
20 The stilbenes, which are closely related to flavonoids, are synthesised by another 
group of unrelated species that includes grape, peanut and pine. 

Besides providing pigmentation to flowers, fruits, seeds, and leaves, 
flavonoids also have key roles in signalling between plants and microbes, in male 
fertility of some species, in defence as antimicrobial agents and feeding 
25 deterrents, and in UV protection. 

Flavonoids also have significant activities when ingested by animals, and 
there is great interest in their potential health benefits, particularly for compounds 
such as isoflavonoids, which have been linked to anticancer benefits, and 
stilbenes that are believed to contribute to reduced heart disease. Condensed 
30 tannins which are plant polyphenols with protein-precipitating and antioxidant 
properties are involved in protein binding, metal chelation, anti-oxidation, and UV- 
light absorption. As a result condensed tannins inhibit viruses, microorganisms, 
insects, fungal pathogens, and monogastric digestion. Moderate amounts of 
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tannins improve forage quality by disrupting protein foam and conferring protection 
from rumen pasture bloat. Bloat is a digestive disorder that occurs on some highly 
nutritious forage legumes such as alfalfa (Medicago sativa) and white clover 
(Trifolium repens). Moderate amounts of tannin can also reduce digestion rates in 
5 the rumen and can reduce parasitic load sufficiently to increase the titre of amino 
acids and small peptides in the small intestine without compromising total 
digestion. 

The major branch pathways of flavonoid biosynthesis start with general 
phenylpropanoid metabolism and lead to the nine major subgroups: the colourless 

10 chalcones, aurones, isoflavonoids, flavones, flavonols, flavandiols, anthocyanins, 
condensed tannins, and phlobaphene pigments. The enzyme phenylalanine 
ammonia-lyase (PAL) of the general phenylpropanoid pathway will lead to the 
production of cinnamic acid. Cinnamate-4-hydroxylase (C4H) will produce p- 
coumaric acid which will be converted through the action of 4-coumaroyl:CoA- 

15 ligase (4CL) to the production of 4-coumaroyl-CoA and malonyl-GoA. The first 
committed step channelling carbon into the flavonoid biosynthesis pathway is 
catalysed by chalcone synthase (CHS), which uses malonyl CoA and 4-coumaryl 
Co A as substrates. 

The Arahidopsis BANYULS gene encodes a dihydroflavonol 4-reductase- 
20 like protein (BAN) that may be an anthocyanine reductase (ACR). The reaction 
catalysed by BAN is considered to be one possible branching point from the 
general flavonoid pathway to the condensed tannin biosynthesis. 

An alternative pathway to condensed tannins is via leucoanthocyanidine 
reductase (LAR). LAR utilises the same substrate as the ACR (BAN) but produces 
25 a 2,3-trans isomer as compared to the 2,3-cis isomer produced by ACR. 

While nucleic acid sequences encoding the key enzymes in the condensed 
tannins biosynthetic pathway CHS, BAN and LAR have been isolated for certain 
species of plants, there remains a need for materials useful in modifying flavonoid 
biosynthesis and more specifically in modifying condensed tannin biosynthesis 
30 and therewith in modifying forage quality, for example by disrupting protein foam 
and conferring protection from rumen pasture bloat, particularly in forage legumes 
and grasses, including alfalfa, medics, clovers, ryegrasses and fescues, and for 
methods for their use. 
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It is an object of the present invention to overcome, or at least alleviate, one 
or more of the difficulties or deficiencies associated with the prior art. 

In one aspect, the present invention provides substantially purified or 
isolated nucleic acids or nucleic acid fragments encoding key polypeptide 
5 enzymes in the condensed tannins biosynthetic pathway CHS, BAN and LAR, or 
functionally active fragments or variants of these enzymes, from a clover 
(Trifolium), medic (Medicago), ryegrass (Lolium) or fescue (Festuca) species. 

The present invention also provides substantially purified or isolated nucleic 
acids or nucleic acid fragments encoding amino acid sequences for a class of 
10 polypeptides which are related to CHS, BAN and LAR or functionally active 
fragments or variants of CHS, BAN or LAR. Such polypeptides are referred to 
herein as CHS-like, BAN-like and LAR-Iike, respectively, and includes 
polypeptides having similar functional activity. 

The individual or simultaneous enhancement or otherwise manipulation of 
15 CHS, BAN and LAR or like gene activities in plants may enhance or otherwise 
alter flavonoid biosynthesis; may enhance or otherwise alter the plant capacity for 
protein binding, metal chelation, anti-oxidation, and UV-light absorption; may 
enhance or reduce or otherwise alter plant pigment production; and may enhance 
or otherwise alter the amount of condensed tannins contained within forage 
20 legumes and grasses, including alfalfa, medics, clovers, ryegrasses and fescues 
and therewith the capacity to reduce bloating by disrupting protein foam. 

Methods for the manipulation of CHS, BAN and LAR or like gene activities 
in plants, including legumes such as clovers (Trifolium species), lucerne 
(Medicago sativa) and grass species such as ryegrasses (Lolium species) and 
25 fescues (Festuca species) may facilitate the production of, for example, forage 
legumes and forage grasses and other crops with enhanced tolerance to biotic 
stresses such as viruses, microorganisms, insects and fungal pathogens; altered 
pigmentation in flowers; forage legumes with enhanced herbage quality and bloat- 
safety. 

30 The clover (Trifolium), medic (Medicago), ryegrass (Lolium) or fescue 

(Festuca) species may be of any suitable type, including white clover (Trifolium 
repens), red clover (Trifolium pratense), subterranean clover (Trifolium 
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subterraneum), alfalfa (Medicago sativa), Italian or annual ryegrass (Lolium 
multiflorum), perennial ryegrass (Lolium perenne), tall fescue (Festuca 
arundinacea), meadow fescue (Festuca pratensis) and red fescue (Festuca rubra). 
Preferably the species is a clover or a ryegrass, more preferably white clover (T. 
5 repens) or perennial ryegrass (L. perenne). White clover (Trifolium repens L.) and 
perennial ryegrass (Lolium perenne L.) are key pasture legumes and grasses, 
respectively, in temperate climates throughout the world. Perennial ryegrass is 
also an important turf grass. 

The nucleic acid or nucleic acid fragment may be of any suitable type and 
10 includes DNA (such as cDNA or genomic DNA) and RNA (such as mRNA) that is 
single- or double-stranded, optionally containing synthetic, non-natural or altered 
nucleotide bases, and combinations thereof. The RNA is readily obtainable, for 
example, by transcription of a DNA sequence according to the present invention, 
to produce an RNA corresponding to the DNA sequence. The RNA may be 
15 synthesised, in vivo or in vitro or by chemical synthesis to produce a sequence 
corresponding to a DNA sequence by methods well known in the art. In this 
specification, where the degree of sequence similarity between an RNA and DNA 
is such that the strand of the DNA could encode the RNA, then the RNA is said to 
"correspond" to that DNA. 

20 In a preferred embodiment of this aspect of the invention, the substantially 

purified or isolated nucleic acid or nucleic acid fragment encoding a CHS or CHS- 
like protein includes the nucleotide sequences shown in Figures 2, 6, 10 and 14 
hereto (Sequence ID Nos. 1, 3, 5 and 7, respectively); (b) complements of the 
sequences recited in (a); (c) sequences antisense to the sequences recited in (a) 

25 and (b); and (d) functionally active fragments and variants of the sequences 
recited in (a), (b) and (c); and (e) RNA sequences corresponding to the sequences 
recited in (a), (b), (c), and (d). 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated nucleic acid or nucleic acid fragment encoding a 
30 BAN or BAN-like protein includes the nucleotide sequence shown in Figure 18 
hereto (Sequence ID No. 9); (b) complements of the sequence recited in (a); (c) 
sequences antisense to the sequences recited in (a) and (b); and (d) functionally 
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active fragments and variants of the sequences recited in (a), (b) and (c); and (e) 
RNA sequences corresponding to the sequences recited in (a), (b), (c), and (d). 

In a still further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated nucleic acid or nucleic acid fragment encoding a 
5 LAR or LAR-like protein includes the nucleotide sequence shown in Figures 22, 26 
and 30 hereto (Sequence ID Nos. 11,13 and 15 respectively); (b) complements of 
the sequences recited in (a); (c) sequences antisense to the sequences recited in 
(a) and (b); and (d) functionally active fragments and variants of the sequences 
recited in (a), (b) and (c); and (e) RNA sequences corresponding to the sequences 
10 recited in (a), (b), (c), and (d). 

The term "isolated" means that the material is removed from its original 
environment (e.g. the natural environment if it is naturally occurring). For example, 
a naturally occurring nucleic acid or polypeptide present in a living plant is not 
isolated, but the same nucleic acid or polypeptide separated from some or all of 
15 the coexisting materials in the natural system, is isolated. Such nucleic acids could 
be part of a vector and/or such nucleic acids could be part of a composition, and 
still be isolated in that such a vector or composition is not part of its natural 
environment. An isolated polypeptide could be part of a composition and still be 
isolated in that such a composition is not part of its natural environment. 

20 The term "purified" means that the nucleic acid or polypeptide is 

substantially free of other nucleic acids or polypeptides. 

By "functionally active" in respect of a nucleic acid it is meant that the 
fragment or variant (such as an analogue, derivative or mutant) is capable of 
modifying flavonoid biosynthesis in a plant. Such variants include naturally 

25 occurring allelic variants and non-naturally occurring variants. Additions, deletions, 
substitutions and derivatizations of one or more of the nucleotides are 
contemplated so long as the modifications do not result in loss of functional activity 
of the fragment or variant. Preferably the functionally active fragment or variant 
has at least approximately 80% identity to the relevant part of the above 

30 mentioned sequence, more preferably at least approximately 90% identity, most 
preferably at least approximately 95% identity. Such functionally active variants 
and fragments include, for example, those having nucleic acid changes which 
result in conservative amino acid substitutions of one or more residues in the 
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corresponding amino acid sequence. Preferably the fragment has a size of at least 
30 nucleotides, more preferably at least 45 nucleotides, most preferably at least 
60 nucleotides. 

By "functionally active" in respect of a polypeptide is meant that the 
5 fragment or variant has one or more of the biological properties or functions of the 
polypeptides CHS, CHS-like, BAN, BAN-Iike, LAR and LAR-like, respectively. 
Additions, deletions, substitutions and derivatizations of one or more of the amino 
acids are contemplated so long as the modifications do not result in loss of 
functional activity of the fragment or variant. Preferably the functionally active 

10 fragment or variant has at least approximately 60% identity to the relevant part of 
the above mentioned sequence, more preferably at least approximately 80% 
identity, most preferably at least approximately 90% identity. Such functionally 
active variants and fragments include, for example, those having conservative 
amino acid substitutions of one or more residues in the corresponding amino acid 

15 sequence. Preferably the fragment has a size of at least 10 amino acids, more 
preferably at least 15 amino acids, most preferably at least 20 amino acids. 

The term "construct" as used herein refers to an artificially assembled or 
isolated nucleic acid molecule which includes the gene of interest. In general a 
construct may include the gene or genes of interest, a marker gene which in some 
20 cases can also be the gene of interest and appropriate regulatory sequences. It 
should be appreciated that the inclusion of regulatory sequences in a construct is 
optional, for example, such sequences may not be required in situations where the 
regulatory sequences of a host cell are to be used. The term construct includes 
vectors but should not be seen as being limited thereto. 

25 The term "vector" as used herein encompasses both cloning and 

expression vectors. Vectors are often recombinant molecules containing nucleic 
acid molecules from several sources. 

By "operatively linked" is meant that said regulatory element(s) is capable of 
causing expression of said nucleic acid(s) or nucleic acid fragment(s) in a plant 
30 cell and said terminator(s) is capable of terminating expression of said nucleic 
acid(s) or nucleic acid fragment(s) in a plant cell. Preferably, said regulatory 
element(s) is upstream of said nucleic acid(s) or nucleic acid fragment(s) and said 
terminator(s) is downstream of said nucleic acid(s) or nucleic acid fragment(s). In 
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a particularly preferred embodiment, each nucleic acid or nucleic acid fragment 
has one or more upstream promoters and one or more downstream terminators, 
although expression of more than one nucleic acid or nucleic acid fragment from 
an upstream regulatory element(s) or termination of more than one nucleic acid or 
5 nucleic acid fragment from a downstream terminator(s) is not precluded. 

By "an effective amount" it is meant an amount sufficient to result in an 
identifiable phenotypic trait in said plant, or a plant, plant seed or other plant part 
derived therefrom. Such amounts can be readily determined by an appropriately 
skilled person, taking into account the type of plant, the route of administration and 
10 other relevant factors. Such a person will readily be able to determine a suitable 
amount and method of administration. See, for example, Sambrook et al., 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, the entire disclosure of which is incorporated herein by reference. 

It will also be understood that the term "comprises" (or its grammatical 
15 variants) as used in this specification is equivalent to the term "includes" and 
should not be taken as excluding the presence of other elements or features. 

Genes encoding other CHS or CHS-Iike, BAN or BAN-like and LAR or LAR- 
like proteins, either as cDNAs or genomic DNAs, may be isolated directly by using 
all or a portion of the nucleic acids or nucleic acid fragments of the present 

20 invention as hybridisation probes to screen libraries from the desired plant 
employing the methodology well known to those skilled in the art. Specific 
oligonucleotide probes based upon the nucleic acid sequences of the present 
invention may be designed and synthesized by methods known in the art. 
Moreover, the entire sequences may be used directly to synthesize DNA probes 

25 by methods known to the skilled artisan such as random primer DNA labelling, 
nick translation, or end-labelling techniques, or RNA probes using available in vitro 
transcription systems. In addition, specific primers may be designed and used to 
amplify a part or all of the sequences of the present invention. The resulting 
amplification products may be labelled directly during amplification reactions or 

30 labelled after amplification reactions, and used as probes to isolate full-length 
cDNA or genomic fragments under conditions of appropriate stringency. 

In addition, short segments of the nucleic acids or nucleic acid fragments of 
the present invention may be used in protocols to amplify longer nucleic acids or 
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nucleic acid fragments encoding homologous genes from DNA or RNA. For 
example, polymerase chain reaction may be performed on a library of cloned 
nucleic acid fragments wherein the sequence of one primer is derived from the 
nucleic acid sequences of the present invention, and the sequence of the other 
5 primer takes advantage of the presence of the polyadenylic acid tracts to the 3' 
end of the mRNA precursor encoding plant genes. Alternatively, the second primer 
sequence may be based upon sequences derived from the cloning vector. For 
example, those skilled in the art can follow the RACE protocol [Frohman et al. 
(1988), Proc. Natl. Acad. Sci. USA 85:8998, the entire disclosure of which is 

10 incorporated herein by reference] to generate cDNAs by using PCR to amplify 
copies of the region between a single point in the transcript and the 3' or 5 5 end. 
Using commercially available 3' RACE and 5' RACE systems (BRL), specific 3* or 
5' cDNA fragments may be isolated [Ohara et al. (1989), Proc. Natl. Acad. Sci. 
USA 86:5673; Loh et al. (1989), Science 243:217, the entire disclosures of which 

15 are incorporated herein by reference]. Products generated by the 3' and 5' RACE 
procedures may be combined to generate full-length cDNAs. 

In a second aspect of the present invention there is provided a substantially 
purified or isolated polypeptide from a clover, (Trifolium), medic (Medicago), 
ryegrass (Lolium) or fescue {Festuca) species, selected from the group consisting 
20 of CHS and CHS-Iike, BAN and BAN-like, and LAR and LAR-like proteins; and 
functionally active fragments and variants thereof. 

The clover (Trifolium), medic (Medicago), ryegrass (Lolium) or fescue 
(Festuca) species may be of any suitable type, including white clover (Trifolium 
repens), red clover (Trifolium pratense), subterranean clover (Trifolium 
25 suhterraneum), alfalfa (Medicago sativa), Italian or annual ryegrass (Lolium 
multiflorum), perennial ryegrass (Lolium perenne), tall fescue (Festuca 
arundinacea), meadow fescue (Festuca pratensis) and red fescue (Festuca rubra). 
Preferably the species is a clover or a ryegrass, more preferably white clover (T. 
repens) or perennial ryegrass (L. perenne). 

30 In a preferred embodiment of this aspect of the invention, the substantially 

purified or isolated CHS or CHS-Iike polypeptide includes an amino acid sequence 
selected from the group consisting of sequences shown in Figures 3, 7, 1 1 and 15 
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hereto (Sequence ID Nos. 2, 4, 6 and 8, respectively) and functionally active 
fragments and variants thereof. 

In a further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated BAN or BAN-like polypeptide includes an amino 
5 acid sequence shown in Figure 19 hereto (Sequence ID No. 10), and functionally 
active fragments and variants thereof. 

In a still further preferred embodiment of this aspect of the invention, the 
substantially purified or isolated LAR or LAR-like polypeptide includes an amino 
acid sequence selected from the group consisting of sequences shown in Figures 
10 23, 27 and 31 hereto (Sequence ID Nos. 12, 14 and 16, respectively), and 
functionally active fragments and variants thereof. 

In a further embodiment of this aspect of the invention, there is provided a 
polypeptide produced (e.g. recombinantly) from a nucleic acid or nucleic acid 
fragment according to the present invention. Techniques for recombinantly 
15 producing polypeptides are well known to those skilled in the art. 

Availability of the nucleotide sequences of the present invention and 
deduced amino acid sequences facilitates immunological screening of cDNA 
expression libraries. Synthetic peptides representing portions of the instant amino 
acid sequences may be synthesized. These peptides may be used to immunise 
20 animals to produce polyclonal or monoclonal antibodies with specificity for 
peptides and/or proteins including the amino acid sequences. These antibodies 
may be then used to screen cDNA expression libraries to isolate full-length cDNA 
clones of interest. 

In a still further aspect of the present invention there is provided a construct 
25 including one or more nucleic acids or nucleic acid fragments according to the 
present invention. 

In a particularly preferred embodiment the construct may include nucleic 
acids or nucleic acid fragments encoding both CHS or CHS-like and BAN or BAN- 
like polypeptides. 

30 In another preferred embodiment the construct may include nucleic acids or 

nucleic acid fragments encoding both CHS or CHS-like and LAR or LAR-like 
polypeptides. 
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In yet another preferred embodiment the construct may include nucleic 
acids or nucleic acid fragments encoding both LAR or LAR-like and BAN or BAN- 
like polypeptides. 

In an even more preferred embodiment the construct may include nucleic 
5 acids or nucleic acid fragments encoding all three of CHS or CHS-like, BAN or 
BAN-like and LAR or LAR-like polypeptides. 

Constructs including nucleic acids or nucleic acid fragments encoding CHS 
or CHS-like and BAN or BAN-like, and optionally further including nucleic acids or 
nucleic acid fragments encoding LAR or LAR-like, are particularly preferred. 

10 In a still further aspect of the present invention there is provided a vector 

including one or more nucleic acids or nucleic acid fragments according to the 
present invention. 

In a preferred embodiment of this aspect of the invention, the construct may 
include one or several of the following: one or more regulatory elements such as 
15 promoters, one or more nucleic acids or nucleic acid fragments according to the 
present invention and one or more terminators; said one or more regulatory 
elements, one or more nucleic acids or nucleic acid fragments and one or more 
terminators being operatively linked. 

In a particularly preferred embodiment the construct may contain nucleic 
20 acids or nucleic acid fragments encoding both CHS or CHS-like and BAN or BAN- 
like polypeptides, operatively linked to a regulatory element or regulatory 
elements, such that both CHS or CHS-like and BAN or BAN-like polypeptides are 
expressed. 

In another preferred embodiment the construct may contain nucleic acids 
25 or nucleic acid fragments encoding both CHS or CHS-like and LAR or LAR-like 
polypeptides, operatively linked to a regulatory element or regulatory elements, 
such that both CHS or CHS-like and LAR or LAR-like polypeptides are expressed. 

In yet another preferred embodiment the construct may contain nucleic 
acids or nucleic acid fragments encoding both LAR or LAR-like and BAN or BAN- 
30 like polypeptides, operatively linked to a regulatory element or regulatory 
elements, such that both LAR or LAR-like and BAN or BAN-like polypeptides are 
expressed. 
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In an even more preferred embodiment the construct may contain nucleic 
acids or nucleic acid fragments encoding all three of CHS or CHS-like, BAN or 
BAN-like and LAR or LAR-like polypeptides, operatively linked to a regulatory 
element or regulatory elements, such that ail three of CHS or CHS-like, BAN or 
5 BAN-like and LAR or LAR-like polypeptides are expressed. 

Constructs including nucleic acids or nucleic acid fragments encoding CHS 
or CHS-like and BAN or BAN-like, and optionally further including nucleic acids or 
nucleic acid fragments encoding LAR or LAR-like, are particularly preferred. 

The construct or vector may be of any suitable type and may be viral or 
10 non-viral. The vector may be an expression vector. Such vectors include 
chromosomal, non-chromosomal and synthetic nucleic acid sequences, e.g. 
derivatives of plant viruses; bacterial plasmids; derivatives of the Ti plasmid from 
Agrohacterium tumefaciens, derivatives of the Ri plasmid from Agrobacterium 
rhizogenes; phage DNA; yeast artificial chromosomes; bacterial artificial 
15 chromosomes; binary bacterial artificial chromosomes; vectors derived from 
combinations of plasmids and phage DNA. However, any other vector may be 
used as long as it is replicable, integrative or viable in the plant cell. 

The regulatory element and terminator may be of any suitable type and may 

be endogenous to the target plant cell or may be exogenous, provided that they 
20 are functional in the target plant cell. 

Preferably the regulatory element is a promoter. A variety of promoters 
which may be employed in the vectors of the present invention are well known to 
those skilled in the art. Factors influencing the choice of promoter include the 
desired tissue specificity of the vector, and whether constitutive or inducible 
25 expression is desired and the nature of the plant cell to be transformed (e.g. 
monocotyledon or dicotyledon). Particularly suitable promoters include but are not 
limited to the constitutive Cauliflower Mosaic Virus 35S (CaMV 35S) promoter and 
derivatives thereof, the maize Ubiquitin promoter, the rice Actin promoter, and the 
tissue-specific Arabidopsis small subunit (ASSU) promoter. 

30 A variety of terminators which may be employed in the vectors and 

constructs of the present invention are also well known to those skilled in the art. 
The terminator may be from the same gene as the promoter sequence or a 
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different gene. Particularly suitable terminators are polyadenylation signals, such 
as the CaMV 35S polyA and other terminators from the nopaline synthase (nos), 
the octopine synthase (ocs) and the rbcS genes. 

The construct or vector, in addition to the regulatory element(s), the nucleic 
5 acid(s) or nucleic acid fragment(s) of the present invention and the terminator(s), 
may include further elements necessary for expression of the nucleic acid(s) or 
nucleic acid fragment(s), in different combinations, for example vector backbone, 
origin of replication (ori), multiple cloning sites, recognition sites for recombination 
events, spacer sequences, enhancers, introns (such as the maize Ubiquitin Ubi 

10 intron), antibiotic resistance genes and other selectable marker genes [such as the 
neomycin phosphotransferase (npt2) gene, the hygromycin phosphotransferase 
(hph) gene, the phosphinotricin acetyltransferase (bar or pat) gene and the 
gentamycin acetyl transferase (aacC1) gene], and reporter genes [such as beta- 
glucuronidase (GUS) gene (gusA) and green fluorescent protein (gfp)]. The vector 

15 may also contain a ribosome binding site for translation initiation. The vector may 
also include appropriate sequences for amplifying expression. 

As an alternative to use of a selectable marker gene to provide a 
phenotypic trait for selection of transformed host cells, the presence of the vector 
in transformed cells may be determined by other techniques well known in the art, 
20 such as PGR (polymerase chain reaction), Southern blot hybridisation analysis, 
histochemical GUS assays, visual examination including microscopic examination 
of fluorescence emitted by gfp, northern and Western blot hybridisation analyses. 

Those skilled in the art will appreciate that the various components of the 
construct or vector are operatively linked, so as to result in expression of said 
25 nucleic acid(s) or nucleic acid fragment(s). Techniques for operatively linking the 
components of the vector of the present invention are well known to those skilled 
in the art. Such techniques include the use of linkers, such as synthetic linkers, for 
example including one or more restriction enzyme sites. 

The constructs and vectors of the present invention may be incorporated 
30 into a variety of plants, including monocotyledons (such as grasses from the 
genera Lolium, Festuca, Paspalum, Pennisetum, Panicum and other forage and 
turfgrasses, corn, oat, sugarcane, wheat and barley), dicotyledons (such as 
Arabidopsis, tobacco, clovers, medics, eucalyptus, potato, sugarbeet, canola, 
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soybean, chickpea) and gymnosperms. In a preferred embodiment, the vectors 
may be used to transform monocotyledons, preferably grass species such as 
ryegrasses (Lolium species) and fescues (Festuca species), more preferably 
perennial ryegrass, including forage- and turf-type cultivars. In an alternate 
5 preferred embodiment, the constructs and vectors may be used to transform 
dicotyledons, preferably forage legume species such as clovers (Trifolium species) 
and medics (Medicago species), more preferably white clover (Trifolium repens), 
red clover (Trifolium pratense), subterranean clover (Trifolium subterraneum) and 
alfalfa (Medicago sativa). Clovers, alfalfa and medics are key pasture legumes in 
1 0 temperate climates throughout the world. 

Techniques for incorporating the constructs and vectors of the present 
invention into plant cells (for example by transduction, transfection or 
transformation) are known to those skilled in the art. Such techniques include 
A groba cterium- mediated introduction, electroporation to tissues, cells and 
15 protoplasts, protoplast fusion, injection into reproductive organs, injection into 
immature embryos and high velocity projectile introduction to cells, tissues, calli, 
immature and mature embryos. The choice of technique will depend largely on the 
type of plant to be transformed. 

In a further aspect of the present invention there is provided a method of 
20 isogenic transformation of a dicotyledonous plant, said method including 
transforming only one of each pair of cotyledons. This enables the production of 
pairs of transgenic plant and corresponding untransformed negative control in an 
otherwise isogenic genetic background for detailed functional assessment of the 
impact of the transgene on plant phenotype. In a preferred embodiment of this 
25 aspect of the invention, the method may include isogenic transformation of a 
dicotyledonous plant with a construct or vector according to the present invention. 

Cells incorporating the constructs and vectors of the present invention may 
be selected, as described above, and then cultured in an appropriate medium to 
regenerate transformed plants, using techniques well known in the art. The culture 
30 conditions, such as temperature, pH and the like, will be apparent to the person 
skilled in the art. The resulting plants may be reproduced, either sexually or 
asexually, using methods well known in the art, to produce successive generations 
of transformed plants. 
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In a further aspect of the present invention there is provided a plant cell, 
plant, plant seed or other plant part, including, e.g. transformed with, one or more 
constructs, vectors, nucleic acids or nucleic acid fragments of the present 
invention. 

5 The plant cell, plant, plant seed or other plant part may be from any suitable 

species, including monocotyledons, dicotyledons and gymnosperms. In a 
preferred embodiment the plant cell, plant, plant seed or other plant part may be 
from a monocotyledon, preferably a grass species, more preferably a ryegrass 
(Lolium species) or fescue (Festuca species), more preferably perennial ryegrass, 

10 including both forage- and turf-type cultivars. In an alternate preferred embodiment 
the plant cell, plant, plant seed or other plant part may be from a dicotyledon, 
preferably forage legume species such as clovers (Trifolium species) and medics 
(Medicago species), more preferably white clover (Trifolium repens), red clover 
(Trifolium pratense), subterranean clover (Trifolium subterraneum) and alfalfa 

1 5 (Medicago sativa). 

The present invention also provides a plant, plant seed or other plant part, 
or a plant extract derived from a plant cell of the present invention. 

The present invention also provides a plant, plant seed or other plant part, 
or a plant extract derived from a plant of the present invention. 

20 In a further aspect of the present invention there is provided a method of 

modifying condensed tannin biosynthesis; of modifying flavonoid biosynthesis; of 
modifying protein binding, metal chelation, anti-oxidation, and UV-light absorption; 
of modifying plant pigment production; of modifying plant defence to biotic 
stresses such as viruses, microorganisms, insects, fungal pathogens; of modifying 

25 forage quality by disrupting protein foam and conferring protection from rumen 
pasture bloat, said method including introducing into said plant an effective 
amount of a nucleic acid or nucleic acid fragment, construct and/or vector 
according to the present invention. 

In a particularly preferred embodiment the method may include introducing 
30 into said plant nucleic acids or nucleic acid fragments encoding both CHS or CHS- 
like and BAN or BAN-like polypeptides. 
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In another preferred embodiment the method may include introducing into 
said plant nucleic acids or nucleic acid fragments encoding both CHS or CHS-like 
and LAR or LAR-like polypeptides. 

In yet another preferred embodiment the method may include introducing 
5 into said plant nucleic acids or nucleic acid fragments encoding both LAR or LAR- 
like and BAN or BAN-like polypeptides. 

In an even more preferred embodiment the method may include introducing 
into said plant nucleic acids or nucleic acid fragments encoding all three of CHS or 
CHS-like, BAN or BAN-like and LAR or LAR-like polypeptides. 

10 Methods including the combinatorial expression of nucleic acids or nucleic 

acid fragments encoding CHS or CHS-like and BAN or BAN-like, and optionally 
further including the use of nucleic acids or nucleic acid fragments encoding LAR 
or LAR-like, are particularly preferred. 

In a further aspect of the present invention there is provided a method of 
15 inhibiting bloat in an animal, said method including providing the animal with a 
forage plant including a construct, vector, nucleic acid or nucleic acid fragment 
according to the present invention. The animal is preferably a ruminant, including 
sheep, goats and cattle. The forage plant including a construct vector, nucleic acid 
or nucleic acid fragment according to the present invention may form all or part of 
20 the feed of the animal. The forage plant preferably expresses CHS or CHS-like 
proteins, BAN or BAN-like proteins, and/or LAR or LAR-like proteins at higher 
levels than the equivalent wild-type plant. More preferably, the forage plant 
expresses both CHS or CHS-like proteins and BAN or BAN-like proteins; both 
CHS or CHS-like proteins and LAR or LAR-like proteins; or both BAN or BAN-like 
25 proteins and LAR or LAR-like proteins; at higher levels than the equivalent wild- 
type plant. More preferably, the forage plant expresses all three of CHS or CHS- 
like proteins, BAN or BAN-like proteins, and LAR or LAR-like proteins; at higher 
levels than the equivalent wild-type plant. 

In a further aspect of the present invention there is provided a method for 
30 enhancing an animal's growth rate, said method including providing the animal 
with a forage plant including a construct, vector, nucleic acid or nucleic acid 
fragment according to the present invention. The animal is preferably a ruminant, 
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including sheep, goats and cattle. The forage plant including a construct, vector, 
nucleic acid or nucleic acid fragment according to the present invention may form 
all or part of the feed of the animal. The forage plant preferably expresses CHS or 
CHS-like proteins, BAN or BAN-like proteins, and/or LAR or LAR-like proteins at 
5 higher levels than the equivalent wild-type plant. More preferably, the forage plant 
expresses both CHS or CHS-like proteins and BAN or BAN-like proteins; both 
CHS or CHS-like proteins and LAR or LAR-like proteins; or both BAN or BAN-like 
proteins and LAR or LAR-like proteins; at higher levels than the equivalent wild- 
type plant. More preferably, the forage plant expresses all three of CHS or CHS- 
10 like proteins, BAN or BAN-like proteins, and LAR or LAR-like proteins; at higher 
levels than the equivalent wild-type plant. 

It is estimated that the method of enhancing an animal's growth rate 
according to this invention should result in an increase in, for example, lamb 
growth rate of at least approximately 5%, more preferably at least approximately 

15 10%. 

Using the methods and materials of the present invention, condensed 
tannin biosynthesis, flavonoid biosynthesis, protein binding, metal chelation, anti- 
oxidation, UV-light absorption, tolerance to biotic stresses such as viruses, 
microorganisms, insects and fungal pathogens; pigmentation in for example 

20 flowers and leaves; herbage quality and bloat-safety; isoflavonoid content leading 
to health benefits, may be increased or otherwise altered, for example by 
incorporating additional copies of one or more sense nucleic acids or nucleic acid 
fragments of the present invention. They may be decreased or otherwise altered, 
for example by incorporating one or more antisense nucleic acids or nucleic acid 

25 fragments of the present invention. 

Documents cited in this specification are for reference purposes only and 
their inclusion is not acknowledgment that they form part of the common general 
knowledge in the relevant art. 

The present invention will now be more fully described with reference to the 
30 accompanying Examples and drawings. It should be understood, however, that 
the description following is illustrative only and should not be taken in any way as 
a restriction on the generality of the invention described above. 
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In the Figures 

Figure 1 shows the plasmid map in pGEM-T Easy of TrCHSa3. 

Figure 2 shows the nucleotide sequence of TrCHSa3 (Sequence ID No. 1). 

Figure 3 shows the deduced amino acid sequence of TrCHSa3 (Sequence ID No. 
5 2). 

Figure 4 shows plasmid maps of sense and antisense constructs of TrCHSa3 in 
the binary vector pPZP221 :35S 2 . 

Figure 5 shows the plasmid map in pGEM-T Easy of TrCHSc. 
Figure 6 shows the nucleotide sequence of TrCHSc (Sequence ID No. 3). 
10 Figure 7 shows the deduced amino acid sequence of TrCHSc (Sequence ID No. 
4). 

Figure 8 shows plasmid maps of sense and antisense constructs of TrCHSc in the 
binary vector pPZP221 :35S 2 . 

Figure 9 shows the plasmid map in pGEM-T Easy of TrCHSf. 

1 5 Figure 1 0 shows the nucleotide sequence of TrCHSf (Sequence ID No. 5). 

Figure 11 shows the deduced amino acid sequence of TrCHSf (Sequence ID No. 
6). 

Figure 12 shows plasmid maps of sense and antisense constructs of TrCHSf in 
the binary vector pPZP221 :35S 2 . 

20 Figure 13 shows the plasmid map in pGEM-T Easy of TrCHSh. 

Figure 14 shows the nucleotide sequence of TrCHSh (Sequence ID No. 7). 

Figure 15 shows the deduced amino acid sequence of TrCHSh (Sequence ID No. 
8). 

Figure 16 shows plasmid maps of sense and antisense constructs of TrCHSh in 
25 the binary vector pPZP221 :35S 2 . 

Figure 17 shows the plasmid map in pGEM-T Easy of TrBANa. 

Figure 18 shows the nucleotide sequence of TrBANa (Sequence ID No. 9). 
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Figure 19 shows the deduced amino acid sequence of TrBANa (Sequence ID No. 
10). 

Figure 20 shows plasmid maps of sense and antisense constructs TrBANa in the 
binary vector pPZP221 :35S 2 . 

5 Figure 21 shows the plasmid map in pGEM-T Easy of TrLARa. 

Figure 22 shows the nucleotide sequence of TrLARa (Sequence ID No. 1 1 ). 

Figure 23 shows the deduced amino acid sequence of TrLARa (Sequence ID No. 
12). 

Figure 24 shows plasmid maps of sense and antisense constructs of TrLARa in 
1 0 the binary vector pPZP221 :35S 2 . 

Figure 25 shows the plasmid map in pGEM-T Easy of TrLARb. 

Figure 26 shows the nucleotide sequence of TrLARb (Sequence ID No. 13). 

Figure 27 shows the deduced amino acid sequence of TrLARb (Sequence ID No. 
14). 

15 Figure 28 shows plasmid maps of sense and antisense constructs of TrLARb in 
the binary vector pPZP221 :35S 2 . 

Figure 29 shows the plasmid map in pGEM-T Easy of TrLARc. 

Figure 30 shows the nucleotide sequence of TrLARc (Sequence ID No. 15). 

Figure 31 shows the deduced amino acid sequence of TrLARc (Sequence ID No. 
20 16). 

Figure 32 shows plasmid maps of sense and antisense constructs of TrLARc in 
the binary vector pPZP221 :35S 2 . 

Figure 33 shows the plasmid map of the binary vector 
pPZP221:ASSU::TrBAN:35S 2 ::TrCHS. 

25 Figure 34 shows the plasmid maps of the modular vector system comprising a 
binary base vector and 7 auxiliary vectors. 

Figure 35 shows an example of the modular binary transformation vector system 
comprising plasmid maps of the binary transformation vector backbone and 4 
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expression cassettes in auxiliary vectors (A) and the plasmid map of the T-DNA 
region of the final binary transformation vector. 

Figure 36 shows A, white clover cotyledons; B, C, D, selection of plantlets 
transformed with a binary transformation vector constructed as described in 
5 Examples 4 and 5; E, putative transgenic white clover on root-inducing medium; F, 
G, white clover plants transgenic for genes involved in condensed tannin 
biosynthesis. 

Figure 37 shows the molecular analysis of white clover plants transgenic for the 
TrBAN gene with Q-PCR amplification plot, agarose gel of PCR product and 
10 Southern hybridisation blot. 

Figure 38 shows the molecular analysis of white clover plants transgenic for the 
TrCHSf gene with Q-PCR amplification plot and agarose gel of PCR product. 

Figure 39 shows the molecular analysis of white clover plants transgenic for the 
TrLARb gene with Q-PCR amplification plot, agarose gel of PCR product and 
15 Southern hybridisation blot. 

EXAMPLE 1 

Preparation of cDNA libraries, isolation and sequencing of cDNAs coding for 
CHS, CHS-like, BAN, BAN-like, LAR and LAR-like proteins from white clover 

( Tri "folium repens) 

20 cDNA libraries representing mRNAs from various organs and tissues of 

white clover (Trifolium repens) were prepared. The characteristics of the white 
clover libraries are described below (Table 1 ). 

TABLE 1 



cDNA libraries from white clover ( Trifolium repens) 



Library 


Organ/Tissue 


01wc 


Whole seedling, light grown 


02wc 


Nodulated root 3, 5, 10, 14, 21 &28 day old seedling 


03wc 


Nodules pinched off roots of 42 day old rhizobium inoculated plants 


04wc 


Cut leaf and stem collected after 0, 1 , 4, 6 &14 h after cutting 


05wc 


Inflorescences: <50% open, not fully open and fully open 
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Library 


Organ/Tissue 


06wc 


Dark grown etiolated 


07wc 


Inflorescence - very early stages, stem elongation, < 15 petals, 15-20 
petals 


08wc 


seed frozen at-80°C, imbibed in dark overnight at 10°C 


09wc 


Drought stressed plants 


10wc 


AMV infected leaf 


11wc 


WCMV infected leaf 


12wc 


Phosphorus starved plants 


13wc 


Vegetative stolon tip 


14wc 


stolon root initials 


15wc 


Senescing stolon 


16wc 


Senescing leaf 



The cDNA libraries may be prepared by any of many methods available. 
For example, total RMA may be isolated using the Trizol method (Gibco-BRL, 
USA) or the RNeasy Plant Mini kit (Qiagen, Germany), following the 
5 manufacturers' instructions. cDNAs may be generated using the SMART PGR 
cDNA synthesis kit (Clontech, USA), cDNAs may be amplified by long distance 
polymerase chain reaction using the Advantage 2 PGR Enzyme system (Clontech, 
USA), cDNAs may be cleaned using the GeneClean spin column (Bio 101, USA), 
tailed and size fractionated, according to the protocol provided by Clontech. The 
10 cDNAs may be introduced into the pGEM-T Easy Vector system 1 (Promega, 
USA) according to the protocol provided by Promega. The cDNAs in the pGEM-T 
Easy plasmid vector are transfected into Escherichia coli Epicurean coli XL10- 
Goid ultra competent cells (Stratagene, USA) according to the protocol provided 
by Stratagene. 

15 Alternatively, the cDNAs may be introduced into plasmid vectors for first 

preparing the cDNA libraries in Uni-ZAP XR vectors according to the 



WO 2004/090136 



PCT7AU2004/000494 



21 

manufacturer's protocol (Stratagene Cloning Systems, La Jolla, CA, USA). The 
Uni-ZAP XR libraries are converted into plasmid libraries according to the protocol 
provided by Stratagene. Upon conversion, cDNA inserts will be contained in the 
plasmid vector pBlueScript. In addition, the cDNAs may be introduced directly into 
5 precut pBlueScript II SK(+) vectors (Stratagene) using T4 DNA ligase (New 
England Biolabs), followed by transfection into E. coli DH10B cells according to 
the manufacturer's protocol (GIBCO BRL Products). 

Once the cDNA inserts are in plasmid vectors, plasmid DNAs are prepared 
from randomly picked bacterial colonies containing recombinant plasmids, or the 

10 insert cDNA sequences are amplified via polymerase chain reaction using primers 
specific for vector sequences flanking the inserted cDNA sequences. Plasmid 
DNA preparation may be performed robotically using the Qiagen QiaPrep Turbo 
kit (Qiagen, Germany) according to the protocol provided by Qiagen. Amplified 
insert DNAs are sequenced in dye-terminator sequencing reactions to generate 

15 partial cDNA sequences (expressed sequence tags or "ESTs"). The resulting 
ESTs are analysed using an Applied Biosystems ABI 3700 sequence analyser. 

EXAMPLE 2 
DNA sequence analyses 

The cDNA clones encoding CHS, CHS-like, BAN, BAN-like, LAR and LAR- 
20 like proteins were identified by conducting BLAST (Basic Local Alignment Search 
Tool; Altschul et al. (1993), J. Mol. Biol. 215:403-410) searches. The cDNA 
sequences obtained were analysed for similarity to all publicly available DNA 
sequences contained in the eBioinformatics nucleotide database using the 
BLASTn algorithm provided by the National Center for Biotechnology Information 
25 (NCBI). The DNA sequences were translated in all reading frames and compared 
for similarity to all publicly available protein sequences contained in the SWISS- 
PROT protein sequence database using BLASTx algorithm (v 2.0.1) (Gish and 
States (1993), Nature Genetics 3:266-272) provided by the NCBI. 

The cDNA sequences obtained and identified were then used to identify 
30 additional identical and/or overlapping cDNA sequences generated using the 
BLASTN algorithm. The identical and/or overlapping sequences were subjected to 
a multiple alignment using the CLUSTALw algorithm, and to generate a 
consensus contig sequence derived from this multiple sequence alignment. The 
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consensus contig sequence was then used as a query for a search against the 
SW1SS-PROT protein sequence database using the BLASTx algorithm to confirm 
the initial identification. 

EXAMPLE 3 

5 Identification and full-length sequencing of cDNAs encoding white clover 

CHS, BAN and LAR proteins 

To fully characterise for the purposes of the generation of probes for 
hybridisation experiments and the generation of transformation vectors, a set of 
cDNAs encoding white clover CHS, BAN and LAR proteins was identified and fully 
10 sequenced. 

Full-length cDNAs were identified from our EST sequence database using 
relevant published sequences (NCBI databank) as queries for BLAST searches. 
Full-length cDNAs were identified by alignment of the query and hit sequences 
using Sequencher (Gene Codes Corp., Ann Arbor, Ml 48108, USA). The original 

15 plasmid was then used to transform chemically competent XL-1 cells (prepared in- 
house, CaCI 2 protocol). After colony PGR (using HotStarTaq, Qiagen) a minimum 
of three PCR-positive colonies per transformation were picked for initial 
sequencing with M13F and M13R primers. The resulting sequences were aligned 
with the original EST sequence using Sequencher to confirm identity and one of 

20 the three clones was picked for full-length sequencing, usually the one with the 
best initial sequencing result. 

Sequencing of TrBAN could be completed with M13F and M13R primers. 
Sequencing of TrCHSaS, TrCHSc, TrCHSf, TrCHSh, TrLARa, TrLARb and 
TrLARc was completed by primer walking, i.e. oligonucleotide primers were 
25 designed to the initial sequence and used for further sequencing. The sequences 
of the oligonucleotide primers are shown in Table 2. 

Contigs were then assembled in Sequencher. The contigs include the 
sequences of the SMART primers used to generate the initial cDNA library as well 
as pGEM-T Easy vector sequence up to the EcoRI cut site both at the 5' and 3' 
30 end. 
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Plasmid maps and the full cDNA sequences of TrCHSa3, TrCHSc, TrCHSf, 
TrCHSh, TrBANa, TrLARa, TrLARb and TrLARc proteins were obtained (Figures 
1, 2, 5, 6, 9, 10, 13, 14, 17, 18, 21, 22, 25, 26, 29 and 30). 



TABLE 2 

List of primers used for sequencing of the full-length cDNAs of TrCHSa3, 
TrCHSc, TrCHSf, TrCHSh, TrLARa, TrLARb and TrLARc 



gene name 


clone ID 


sequencing primer 


primer sequence (5'>3') 


TrCHSa3 


05wc1 RsB06 


05wc1 RsB06.fl 


AGGAGGCTGCAGTCAAGG 






05wc1RsB06.f2 


TGCCTGAAATTGAGAAACC 






05wc1 RsB06.f3 


AAAGCTAGC CTTGAAGCC 


TrCHSc 


07wc1TsE12 


07wc1TsE12.f1 


T CGGAC AT AACT CATGTGG 






U / WC 1 I St 1 Z.TZ 


J- J- v»3\jvj _L J- vjvjnVTJlrtJLdnJT.vjVJ 






07wc1TsE12.r1 


TGGACATTTATTGGTTGC 






07wc1TsE12.r2 


TAT CATGT C TGGAAATGC 


TrCHSf 


07wc1UsD07 


07wc1 UsD07.fl 


AGATTGCAT CAAAGAATGG 






07wc1UsD07.r1 


GGTCCAAAAGCCAATCC 


TrCHSh 


13wc2lsG04 


13wc2lsG04.f1 


TAAGACGAGACATAGTGG 






•|3wc2lsG04.r1 


TATT CACT AAGCACATGC 


TrLARa 


05wc1 CsA02 


05wc1CsA02.f1 


T C ATTT CTGCAAT AGGAGG 






05wc1CsA02.r1 


AT C C AC CT CAGGTGAAC C 


TrLARb 


05wc3EsA03 


05wc3EsA03.f1 


AATAGGAGGCTCTGATGG 






05wc3EsA03r1 


AT C C AC CT CAGGTGAAC C 


TrLARc 


07wc1VsF06 


07wc1VsF06.f1 


AGGCTCTGATGGCTTGC 






07wc1VsF06.r1 


AT C C AC CT CAGGTGAAC C 



10 EXAMPLE 4 

Development of binary transformation vectors containing chimeric genes 
with cDNA sequences from white clover TrCHSa3, TrCHSc, TrCHSf, TrCHSh, 

TrBANa, TrLARa, TrLARb and TrLARc 

To alter the expression of the proteins involved in flavonoid biosynthesis, 
15 and more specifically condensed tannin biosynthesis to improve herbage quality 
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and bloat-safety, a set of sense and antisense binary transformation vectors was 
produced. 

cDNA fragments were generated by high fidelity PCR with a proofreading 
DNA polymerase using the original pGEM-T Easy plasmid cDNA as a template. 
5 The primers used (Table 3) contained recognition sites for appropriate restriction 
enzymes, for example EcoRI and Xbal, for directional and non-directional cloning 
into the target vector. After PCR amplification and restriction digest with the 
appropriate restriction enzyme (usually Xbal), the cDNA fragments were cloned 
into the corresponding site in a modified pPZP binary vector (Hajdukiewicz et a/., 

10 1994). The pPZP221 vector was modified to contain the 35S 2 cassette from 
pKYLX71:35S 2 (Schardl et a/., 1987) as follows: pKYLX71:35S 2 was cut with Clal. 
The 5' overhang was filled in using Klenow and the blunt end was A-tailed with 
Taq polymerase. After cutting with EcoRI, the 2kb fragment with an EcoRI- 
compatible and a 3'-A tail was gel-purified. pPZP221 was cut with Hindi II and the 

1 5 resulting 5' overhang filled in and T-tailed with Taq polymerase. The remainder of 
the original pPZP221 multi-cloning site was removed by digestion with EcoRI, and 
the expression cassette cloned into the EcoRI site and the 3' T overhang restoring 
the Hindlll site. This binary vector contains between the left and right border the 
plant selectable marker gene aacC1 under the control of the 35S promoter and 

20 35S terminator and the pKYLX71:35S 2 -derived expression cassette with a CaMV 
35S promoter with a duplicated enhancer region and an rbcS terminator. 

Alternatively, the primers for the amplification of cDNA fragments contained 
attB sequences for use with recombinases utilising the GATEWAY® system 
(Invitrogen). The resulting PCR fragments were used in a recombination reaction 

25 with pDONR® vector (Invitrogen) to generate entry vectors. A GATEWAY® cloning 
cassette (Invitrogen) was introduced into the multicloning site of the 
pPZP221:35S 2 vector following the manufacturer's protocol. In a further 
recombination reaction, the cDNAs encoding the open reading frame sequences 
were transferred from the entry vector to the GATEWAY®-enabled pPZP221 :35S 2 

30 vector. 

The orientation of the constructs (sense or antisense) was checked by 
restriction enzyme digest and sequencing which also confirmed the correctness of 
the sequence. Transformation vectors containing chimeric genes using full-length 
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open reading frame cDNAs encoding white clover TrCHSa3, TrCHSc, TrCHSf, 
TrCHSh, TrBANa, TrLARa, TrLARb and TrLARc proteins in sense and antisense 
orientation under the control of the CaMV 35S 2 promoter were generated (Figures 
4,8, 12, 16, 20,24, 28 and 32). 

5 

TABLE 3 



List of primers used to PCR-ampIify the open reading frames 



gene name 


primer 


primer sequence (5'->3') 


TrCHSa3 


05wc1 RsB06f 


GAATT CTAGAAGATATGGTGAGTGTAGCTG 




05wc1RsB06r 


GAATTCTAGAAT CACACAT CTTATATAGCC 


TrCHSa3 


05wc1RsB06fG 


GGGGACAAGTTTGTACAAAAAAGCAGGCTTCTAGA 
AGATATGGTGAGTGTAGCTG 




05wc1RsB06rG 


GGGGACCACTTTGTACAAGAAAGCTGGGTTCTAGA 
AT CACACAT CTTATATAGCC 


TrCHSc 


07wc1TsE12f 


GAATT CTAGAAGAAGAAAT ATGGGAGACGAAGG 




07wc1TsE12r 


GAATT C TAGAAAGACTT C ATGCAC ACAAGTT C C 


TrCHSf 


07wc1UsD07f 


GAATT CTAGATGATTCATTGTTTGTTTCCATAAC 




U / WC I Ubi-JU i 1 


C* A A TT CTAGAA CAT ATT C AT CTT C CT AT CAC 


TrCHSh 


13wc2lsG04f 


GAATT CTAGAT C C AAATT CT CGTAC CT CAC C 




13wc2IsG04r 


GAATTCTAGATAGTTCACATCTCTCGGCAGG 


TrBANa 


05wc2XsG02f 


GGAT C CT CT AGAGCACTAGTGTGTATAAGTTT CTT 
GG 




05wc2XsG02r 


GGAT C CT CTAGAC C C C CTTAGT CTT AAAATACT CG 


TrLARa 


05wc1 CsA02fG 


GGGGAC AAGTTTGTAC AAAAAAGC AGGCT CTAGAA 
AGCAAAGCAATGGCACC 




05wc1 CsA02rG 


GGGGAC CACTTTGTACAAGAAAGCTGGGT CTAGAT 
C CAC CT CAGGTGAAC C 


TrLARb 


05wc3EsA03fG 


GGGGACAAGTTTGT ACAAAAAAGCAGGCT CTAGAA 
AGCAATGGCACCAGCAGC 




05wc3EsA03rG 


GGGGACCACTTTGTACAAGAAAGCTGGGTCTAGAT 
C CAC C T CAGGTGAAC C 


TrLARc 


07wclVsF06fG 


GGGGACAAGTTTGTACAAAAAAGCAGGCT CTAGAT 
AAAGC AATGGC AC CAGC ' 




07wc1VsF06rG 


GGGGAC CACTTTGTACAAGAAAGCTGGGT CTAGAT 
C CAC CT CAGGTGAAC C 
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The pPZP221:35S 2 binary vector was further modified to contain two 
expression cassettes within one T-DNA. The expression cassette from the binary 
vector pWM5 consisting of the ASSU promoter and the tob terminator was PCR- 
amplified with a proofreading DNA polymerase using oligonucleotide primers with 
5 the following sequences: 

forward primer 5' -ccaccatgtttgaaatttattatgtgtttttttccg-3 ' ; 

reverse primer 5 ' -taatcccgggtaagggcagcccatacaaatgaagc-3 ' . 

The PCR product was cut with BstXI and Smal and cloned directionally into 
1 0 the equally cut pPZP221 :35S 2 vector. Additionally, a GATEWAY® cloning cassette 
(Invitrogen) was introduced into the multicloning site in the 35S 2 :rbcS expression 
cassette following the manufacturer's protocol. TrBANa was cloned into the 
ASSlhtob expression cassette, TrCHSaS was amplified with the GATEWAY®- 
compatible primers (see Table 3) and cloned into the 35S 2 :rbcS expression 
15 cassette. A transformation vector containing chimeric genes using full-length open 
reading frame cDNAs encoding white clover TrBANa protein in sense orientation 
under the control of the ASSU promoter and TrCHSc3 protein in sense orientation 
under the control of the CaMV 35S 2 promoter within the same T-DNA was 
generated (Figure 33). 

20 EXAMPLE 5 

Development of binary transformation vectors containing chimeric genes 

with a combination of 2 or more cDNA sequences from white clover 
TrCHSa3, TrCHSc, TrCHSf, TrCHSh, TrBANa, TrLARa, TrLARb and TrLARc 

To alter the expression of the proteins involved in flavonoid biosynthesis, 
25 and more specifically condensed tannin biosynthesis to improve herbage quality 
and bloat-safety, a modular binary transformation vector system was used (Figure 
34). The modular binary vector system enables simultaneous integration of up to 
seven transgenes the expression of which is controlled by individual promoter and 
terminator sequences into the plant genome (Goderis et a/., 2002). 

30 The modular binary vector system consists of a pPZP200-derived vector 

(Hajdukiewicz et a/., 1994) backbone containing within the T-DNA a number of 
unique restriction sites recognised by homing endonucleases. The same 
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restriction sites are present in pUC18-based auxiliary vectors flanking standard 
multicloning sites. Expression cassettes comprising a selectable marker gene 
sequence or a cDNA sequence to be introduced into the plant under the control of 
regulatory sequences like promoter and terminator can be constructed in the 
5 auxiliary vectors and then transferred to the binary vector backbone utilising the 
homing endonuclease restriction sites. Up to seven expression cassettes can thus 
be integrated into a single binary transformation vector. The system is highly 
flexible and allows for any combination of cDNA sequence to be introduced into 
the plant with any regulatory sequence. 

10 For example, a selectable marker cassette comprising the nos promoter 

and nos terminator regulatory sequences controlling the expression of the nptll 
gene was PCR-amplified using a proofreading DNA polymerase from the binary 
vector pKYLX71 :35S 2 and directionally cloned into the Agel and Notl sites of the 
auxiliary vector pAUX3166. Equally, other selectable marker cassettes can be 

1 5 introduced into any of the auxiliary vectors. 

In another example, the expression cassette from the binary vector pWM5 
consisting of the ASSU promoter and the tob terminator was PCR-amplified with a 
proofreading DNA polymerase and directionally cloned into the Agel and Notl sites 

of the auxiliary vector pAUX3169. Equally, other expression cassettes can be 
20 introduced into any of the auxiliary vectors. 

In yet another example, the expression cassette from the direct gene 
transfer vector pDH51 was cut using EcoRI and cloned directly into the EcoRI site 
of the auxiliary vector pAUX31 32. 

TABLE 4 

25 List of primers used to PCR-amplify plant selectable marker cassettes and 
the regulatory elements used to control the expression of TrCHSa3, TrCHSc, 
TrCHSf, TrCHSh, TrBANa, TrLARa, TrLARb and TrLARc genes 



expression 
cassette 


primer 


primer sequence (5'>3') 


nos::nptII-nos 


forward 


ATAATAACCGGTTGATCATGAGCGGAGAATTAAG 
GG 




reverse 


ATAATAGCGGC CGC TAGTAAC ATAGATGAC AC C G 
CG 
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expression 
cassette 


primer 


primer sequence <>o / 


35S::aacC1-35S 


forward 


AAlAbLbbLLbrLbAl 1 1 1 1 brbri-1 1 1 X -L^JrVj 




reverse 


7\ 7\ m 7\ tv r*\ r\ r~\ m 7\ r~i f\ 7\ 7\ 7\ Pr*7\ f"" 1 7\ T 1 /T" 1r T , f"' 1 /'~ i 

AATAAC C GGTAO C CJAC (jAAtjtj/UaL- A iLbl LjLj 


35S 2 ::rbcS 


forward 


ATAATAACCGGTGCCCGGGGATCTCCTTTGCC 




reverse 


ATAATAGCGGCCGCATGCATGTTGTCAATCAATT 
GG 


assu::tob 


forward 


TAATACCGGTAAATTTATTATGRGTTTTTTTCCG 




reverse 


TAATGCGGCCGCTAAGGGCAGCCCATACAAATGA 
AGC 



The expression cassettes were further modified by introducing a 
GATEWAY® cloning cassette (Invitrogen) into the multicloning site of the 
respective pAUX vector following the manufacturer's protocol. In a recombination 
5 reaction, the cDNAs encoding the open reading frame sequences were transferred 
from the entry vector obtained as described in Example 4 to the GATEWAY®- 
enabled pAUX vector. Any combination of the regulatory elements with cDNA 
sequences of TrCHSaS, TrCHSc, TrCHSf, TrCHSh, TrBANa, TrLARa, TrlARb 
and TrLARc can be obtained. One typical example is given in Figure 35 with 
10 expression cassettes for the nptll plant selectable marker, TrBANa, TrLARa and 
TrCHSaS. 

Complete expression cassettes comprising any combination of regulatory 
elements and cDNA sequences to be introduced into the plant were then cut from 
the auxiliary vectors using the respective homing endonuclease and cloned into 
1 5 the respective restriction site on the binary vector backbone. After verification of 
the construct by nucleotide sequencing, the binary transformation vector 
comprising a number of expression cassettes was used to generate transgenic 
white clover plants. 

EXAMPLE 6 
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Production by Agrobacterium-mediated transformation and analysis of 
transgenic white clover plants carrying chimeric white clover TrCHSaS, 
TrCHSc, TrCHSf, TrCHSh, TrBANa, TrLARa, TrLARb and TrLARc genes 
involved in flavonoid biosynthesis 

5 A set of binary transformation vectors carrying chimeric white clover genes 

involved in flavonoid biosynthesis, and more specifically condensed tannin 
biosynthesis to improve herbage quality and bloat-safety, were produced as 
detailed in Examples 4 and 5. 

yAgrojbacfer/i/m-mediated gene transfer experiments were performed using 
1 0 these transformation vectors. 

The production of transgenic white clover plants carrying the white clover 
TrCHSaS, TrCHSc, TrCHSf, TrCHSh, TrBANa, TrLARa, TrLARb and TrLARc 
cDNAs, either singly or in combination, is described here in detail. 

Preparation of Agrobacterium 

15 Agrobacterium tumefaciens strain AGL-1 transformed with one of the binary 

vector constructs detailed in Example 6 were streaked on LB medium containing 
50 pg/ml rifampicin and 50 pg/ml kanamycin and grown at 27 °C for 48 hours. A 
single colony was used to inoculate 5 ml of LB medium containing 50 pg/ml 
rifampicin and 50 pg/ml kanamycin and grown over night at 27 °C and 250 rpm on 

20 an orbital shaker. The overnight culture was used as an inoculum for 500 ml of LB 
medium containing 50 pg/ml kanamycin only. Incubation was over night at 27 °C 
and 250 rpm on an orbital shaker in a 2 I Erlenmeyer flask. 

Preparation of white clover seeds 

1 spoon of seeds (ca. 500) was placed into a 280 |nm mesh size sieve and 
25 washed for 5 min under running tap water, taking care not to wash seeds out of 
sieve. In a laminar flow hood, seeds were transferred with the spoon into an 
autoclaved 100 ml plastic culture vessel. A magnetic stirrer (wiped with 70% 
EtOH) and ca. 30 ml 70% EtOH were added, and the seeds were stirred for 5 min. 
The EtOH was discarded and replaced by 50 ml 1.5% sodium hypochlorite. The 
30 seeds were stirred for an additional 45 - 60 min on a magnetic stirrer. The sodium 
hypochlorite was then discarded and the seeds rinsed 3 to 4 times with autoclaved 
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ddH 2 0. Finally 30 ml of ddH 2 0 were added, and seeds incubated over night at 10 
- 15°C in an incubator. 

Agrobacterium-mediated transformation of white clover 

The seed coat and endosperm layer of the white clover seeds prepared as 
5 above were removed with a pair of 1 8 G or 21 G needles. The cotyledons were cut 
from the hypocotyl leaving a ca. 1.5 mm piece of the cotyledon stalk. The 
cotyledons were transferred to a petridish containing ddH 2 0. After finishing the 
isolation of clover cotyledons, ddH 2 Q in the petridish was replaced with 
Agrobacterium suspension (diluted to an OD 6 oo = 0.2 - 0.4). The petridish was 
10 sealed with its lid and incubated for 40 min at room temperature. 

After the incubation period, each cotyledon was transferred to paper towel 
using the small dissection needle, dried and plated onto regeneration medium 
RM73. The plates were incubated at 25°C with a 16h light/8h dark photoperiod. 

On day 4, the explants were transferred to fresh regeneration medium. Cotyledons 
1 5 transformed with Agrobacterium were transferred to RM73 containing cefotaxime 
(antibacterial agent) and gentamycin. The dishes were sealed with Parafilm and 
incubated at 25°C under a 16/8 h photoperiod. Explants were subcultured every 
three weeks for a total of nine weeks onto fresh RM 73 containing cefotaxime and 
gentamycin. Shoots with a green base were then transferred to root-inducing 
20 medium RIM. Roots developed after 1-3 weeks, and plantlets were transferred to 
soil when the roots were well established. 

This process is shown in detail in Figure 36. 

Preparation of genomic DNA for real-time PGR and analysis for the presence 
of transgenes 

25 3-4 leaves of white clover plants regenerated on selective medium were 

harvested and freeze-dried. The tissue was homogenised on a Retsch MM300 
mixer mill, then centrifuged for 10 min at 1700xg to collect cell debris. Genomic 
DNA was isolated from the supernatant using Wizard Magnetic 96 DNA Plant 
System kits (Promega) on a Biomek FX (Beckman Coulter). 5 pi of the sample (50 

30 pi) were then analysed on an agarose gel to check the yield and the quality of the 
genomic DNA. 
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Genomic DNA was analysed for the presence of the transgene by real-time 
PGR using SYBR Green chemistry. PCR primer pairs (Table 4) were designed 
using MacVector (Acceirys) or PrimerExpress (ABI). The forward primer was 
located within the 35S 2 promoter region and the reverse primer within the 
5 transgene to amplify products of approximately 150 - 250 bp as recommended. 
The positioning of the forward primer within the 35S 2 promoter region guaranteed 
that endogenous genes in white clover were not detected. 



10 

TABLE 5 



List of primers used for Real-time PCR analysis of white clover plants 

transformed with chimeric white clover genes involved in condensed tannin 

biosynthesis 



construct 


primer 1 (forward), 5*->3 ? 


primer 2 (reverse), 5'~>3 ? 


pPZP221TrCHSa3 


CATTTCATTTGGAGAGGACACGC 


AACACGGTTTGGTGGATTTGC 


pPZP221TrCHSc 


TTGGAGAGGACACGCTGAAATC 


AC AAGTTGGTGAGGGAATGC C 


pPZP221TrCHSf 


CATTTCATTTGGAGAGGACACGC 


TCGTTGCCTTTCCCTGAGTAGG 


pPZP221TrCHSh 


TCATTTGGAGAGGACACGCTG 


CGGTCACCATTTTTTTGTTGGAGG 


pPZP221TrBANa 


TTGGAGAGGACACGCTGAAATC 


CAACAAAACCAGTGCCACC 


pPZP221TrLARa 


ATGACGCACAATCCCACTATCC 


AGC CTTAGAAGAGAGAAGAGGT C C 


pPZP221TrLARb 


ATGACGC ACAAT C C C ACTAT C C 


AGCCTTAGAAGAGAGAAGAGGTCC 


pPZP221TrLARc 


ATGACGCACAATCCCACTATCC 


AGC CTTAGAAGAGAGAAGAGGT C C 



15 5 pi of each genomic DNA sample was run in a 50 ul PCR reaction 

including SYBR Green on an ABI 7700 (Applied Biosystems) together with 
samples containing DNA isolated from wild type white clover plants (negative 
control), samples containing buffer instead of DNA (buffer control) and samples 
containing the plasmid used for transformation (positive plasmid control). Cycling 

20 conditions used were 2 min. at 50 °C, 10 min. at 95 °C and then 40 cycles of 15 
sec. at 95 °C, 1 min. at 60 °C. 
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Preparation of genomic DNA and analysis of DNA for presence and copy 
number of transgene by Southern hybridisation blotting 

Genomic DNA for Southern hybridisation blotting was obtained from leaf 
material of white clover plants following the CTAB method. Southern hybridisation 
5 blotting experiments were performed following standard protocols as described in 
Sambrook et al. (1989). In brief, genomic DNA samples were digested with 
appropriate restriction enzymes and the resulting fragments separated on an 
agarose gel. After transfer to a membrane, a cDNA fragment representing a 
transgene or selectable marker gene was used to probe the size-fractionated DNA 
10 fragments. Hybridisation was performed with either radioactively labelled probes 
or using the non-radioactive DIG labelling and hybridisation protocol (Boehringer) 
following the manufacturer's instructions. 

Plants were obtained after transformation with all chimeric constructs and 
selection on medium containing gentamycin. Details of plant analysis are given in 
15 Table 5 and Figures 37, 38 and 39. 
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Finally, it is to be understood that various alterations, modifications and/or 
additions may be made without departing from the spirit of the present invention 
as outlined herein. 
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CLAIMS 

1. A substantially purified or isolated nucleic acid or nucleic acid 
fragment encoding a condensed tannin biosynthetic enzyme selected from the 
group consisting of chalcone synthase (CHS), CHS-like, dihydroflavonol 4- 

5 reductase (BAN), BAN-like, leucoanthocyanidine reductase (LAR) and LAR-like, or 
a functionally active fragment or variant of such a polypeptide, from a clover 
(Trifolium), medic (Medicago), ryegrass (Lolium) or fescue (Festuca) species. 

2. A nucleic acid or nucleic acid fragment according to Claim 1 , wherein 
said nucleic acid or nucleic acid fragment is from white clover (Trifolium repens) or 

1 0 perennial ryegrass (Lolium perenne). 

3. A nucleic acid or nucleic acid fragment according to Claim 1, 
encoding a CHS polypeptide or CHS-like polypeptide and including a nucleotide 
sequence selected from the group consisting of (a) sequences shown in Figures 2, 
6, 10 and 14 hereto (Sequence ID Nos. 1, 3, 5 and 7, respectively); (b) 

15 complements of the sequences recited in (a); (c) sequences antisense to the 
sequences recited in (a) and (b); and (d) functionally active fragments and variants 
of the sequences recited in (a), (b) and (c); and (e) RNA sequences corresponding 
to the sequences recited in (a), (b), (c) and (d). 

4. A nucleic acid or nucleic acid fragment according to Claim 1, 
20 encoding a BAN polypeptide or BAN-like polypeptide and including a nucleotide 

sequence selected from the group consisting of (a) sequence shown in Figure 18 
hereto (Sequence ID No. 9); (b) complements of the sequence recited in (a); (c) 
sequences antisense to the sequences recited in (a) and (b); and (d) functionally 
active fragments and variants of the sequences recited in (a), (b) and (c); and (e) 
25 RNA sequences corresponding to the sequences recited in (a), (b), (c) and (d). 

5. A nucleic acid or nucleic acid fragment according to Claim 1, 
encoding a LAR polypeptide or LAR-like polypeptide and including a nucleotide 
sequence selected from the group consisting of (a) sequences shown in Figures 
22, 26 and 30 hereto (Sequence ID Nos. 11, 13 and 15, respectively); (b) 

30 complements of the sequences recited in (a); (c) sequences antisense to the 
sequences recited in (a) and (b); and (d) functionally active fragments and variants 
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of the sequences recited in (a), (b) and (c); and (e) RNA sequences corresponding 
to the sequences recited in (a), (b), (c) and (d). 

6. A construct including one or more nucleic acid or nucleic acid 
fragments according to any one of claims 1 to 5. 

5 7. A construct according to claim 6 including nucleic acids or nucleic 

acid fragments encoding both CHS or CHS-like and BAN or BAN-like 
polypeptides. 

8. A construct according to claim 6 including nucleic acids or nucleic 
acid fragments encoding both CHS or CHS-like and LAR or LAR-like polypeptides. 

10 9. A construct according to claim 6 including nucleic acids or nucleic 

acid fragments encoding both LAR or LAR-like and BAN or BAN-like polypeptides. 

10. A construct according to claim 6 including nucleic acids or nucleic 
acid fragments encoding all three of CHS or CHS-like, LAR or LAR-like and BAN 
or BAN-like polypeptides. 

15 11. A construct according to any one of claims 6 to 10 wherein the one 

or more nucleic acids or nucleic acid fragments are operably linked to one or more 
regulatory elements, such that the one or more nucleic acids or nucleic acid 

fragments are expressed. 

12. A construct according to Claim 11, wherein the one or more 
20 regulatory elements include a promoter and a terminator, said promoter, nucleic 

acid or nucleic acid fragment and terminator being operatively linked. 

13. A plant cell, plant, plant seed or other plant part, including a construct 
according to any one of claims 6 to 12. 

14. A plant, plant seed or other plant part derived from a plant cell or 
25 plant according to Claim 13. 

15. A method of modifying one or more of condensed tannin 
biosynthesis; protein binding, metal chelation; anti oxidation; UV-light absorption; 
pigment production; or plant defence to a biotic stress; in a plant, said method 
including introducing into said plant an effective amount of a nucleic acid or 

30 nucleic acid fragment according to any one or claims 1 to 5 or a construct 
according or any one of claims 6 to 12. 
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16. A method according to claim 15 wherein said method includes 
introducing into said plant effective amounts of nucleic acids or nucleic acid 
fragments encoding both CHS or CHS-iike and BAN or BAN-like polypeptides. 

17. A method according to claim 15 wherein said method includes 
5 introducing into said plant effective amounts of nucleic acids or nucleic acid 

fragments encoding both CHS or CHS-like and LAR or LAR-like polypeptides 

18. A method according to claim 15 wherein said method includes 
introducing into said plant effective amounts of nucleic acids or nucleic acid 
fragments encoding both LAR or LAR-like and BAN or BAN-like polypeptides. 

10 19. A method according to claim 15 wherein said method includes 

introducing into said plant effective amounts of nucleic acids or nucleic acid 
fragments encoding all three of CHS or CHS-like, BAN or BAN-like and LAR or 
LAR-like polypeptides. 

20. A method according to any one of claims 15 to 19 wherein the 
15 method is modifying plant defence to biotic stress and the biotic stress is selected 

from the group consisting of viruses, micro-organisms, insects and fungal 
pathogens. 

21. A method of modifying forage quality of a plant by disrupting protein 
foam and/or conferring protection from rumen pasture bloat, said method including 

20 introducing into said plant an effective amount of a nucleic acid fragment 
according to any one of claims 1 to 5 or a construct according to any one of claims 
6 to 12. 

22. A method according to claim 21 wherein said method includes 
introducing into said plant effective amounts of nucleic acids or nucleic acid 

25 fragments encoding both CHS or CHS-like and BAN or BAN-like polypeptides. 

23. A method according to claim 21 wherein said method includes 
introducing into said plant effective amounts of nucleic acids or nucleic acid 
fragments encoding both CHS or CHS-like and LAR or LAR-like polypeptides 

24. A method according to claim 21 wherein said method includes 
30 introducing into said plant effective amounts of nucleic acids or nucleic acid 

fragments encoding both LAR or LAR-like and BAN or BAN-like polypeptides. 
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25. A method according to claim 21 wherein said method includes 
introducing into said plant effective amounts of nucleic acids or nucleic acid 
fragments encoding all three of CHS or CHS-like, BAN or BAN-like and LAR or 
LAR-like polypeptides. 

5 26. Use of a nucleic acid or nucleic acid fragment according to any one 

of claims 1 to 5, and/or nucleotide sequence information thereof, and/or single 
nucleotide polymorphisms thereof as a molecular genetic marker. 

27. A substantially purified or isolated polypeptide from a clover 
(Trifolium), medic (Medicago), ryegrass (Lolium) or fescue (Festuca) species, 

10 selected from the group consisting of CHS and CHS-like, BAN and BAN-like and 
LAR and LAR-like; and functionally active fragments and variants thereof. 

28. A polypeptide according to Claim 27, wherein said polypeptide is 
from white clover (Trifolium repens) or perennial ryegrass (Lolium perenne). 

29. A polypeptide according to Claim 27, wherein said polypeptide is 
15 CHS or CHS-like and includes an amino acid sequence selected from the group 

consisting of sequences shown in Figures 3, 7, 11 and 15 hereto (Sequence ID 
Nos. 2, 4, 6 and 8, respectively); and functionally active fragments and variants 
thereof. 

30. A polypeptide according to Claim 27, wherein said polypeptide is 
20 BAN or BAN-like and includes an amino acid sequence selected from the group 

consisting of sequence shown in Figure 19 hereto (Sequence ID No. 10); and 
functionally active fragments and variants thereof. 

31. A polypeptide according to Claim 27, wherein said polypeptide is 
LAR or LAR-like and includes an amino acid sequence selected from the group 

25 consisting of sequences shown in Figures 23, 27 and 31 hereto (Sequence ID 
Nos. 12, 14 and 16, respectively); and functionally active fragments and variants 
thereof. 
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1 


GAATTCACTA 


GTGATTAAGC 


AGTGGTAACA 


ACGCAGAGTA 


CGCGGGGAAC 


51 


AAAAACAACT 


ACGCATATTA 


TATATATATA 


TATATAGTCT 


ATAATTGAAA 


101 


GAAACTGCTA 


AAGATATTAT 


TAAGATATGG 


TGAGTGTAGC 


TGAAATTCGC 


151 


AAGGCTCAGA 


GGGCTGAAGG 


CCCTGCAACC 


ATTTTGGCCA 


TTGGCACTGC 


201 


AAATCCACCA 


AACCGTGTTG 


AGCAGAGCAC 


ATATCCTGAT 


TTCTACTTCA 


251 


AAATTACAAA 


CAGTGAGCAC 


AAGACTGAGC 


TCAAAGAGAA 


GTTCCAACGC 


301 


ATGTGTGACA 


AATCCATGAT 


CAAGAGCAGA 


TACATGTATC 


TAACAGAAGA 


351 


GATTTTGAAA 


GAAAATCCTA 


GTCTTTGTGA 


ATACATGGCA 


CCTTCATTGG 


401 


ATGCTAGGCA 


AGACATGGTG 


GTGGTTGAGG 


TAC CTAGACT 


TGGGAAGGAG 


451 


GCTGCAGTCA 


AGGC CATTAA 


AGAATGGGGT 


CAAC CAAAGT 


CAAAGATTAC 


501 


TCACTTAATC 


TTTTGCACCA 


CAAGTGGTGT 


TGACATGCCT 


GGTGCTGATT 


551 


ACCAACTCAC 


AAAACT CTTA 


GGTCTTCGCC 


CATATGTGAA 


AAGGTATATG 


601 


ATGTACCAAC 


AAGGTTGTTT 


TGCAGGAGGC 


ACGGTGCTTC 


GTTTGGCAAA 


651 


AGATTTGGCC 


GAGAACAACA 


AAGGTGCTCG 


TGTGCTAGTT 


GTTTGTTCTG 


701 


AAGTCACCGC 


AGTCACATTT 


CGCGGCCCCA 


GTGAT ACT CA 


CTTGGACAGT 


751 


CTTGTTGGAC 


AAGCATTGTT 


TGGAGATGGA 


GCCGCTGCAC 


TAATTGTTGG 


801 


TTCTGATCCA 


GTGCCTGAAA 


TTGAGAAACC 


AATATTTGAG 


ATGGTTTGGA 


851 


CTGCACAAAC 


AATTGCTCCA 


GACAGTGAAG 


GTGCCATTGA 


TGGTCATCTT 




L-Or ± <^Jt\t%\^\^ ± Or 
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951 


AT C AAAGAAC 


ATTAATAAAG 


CATTGGTTGA 


GGCTTTCCAA 


CCATTAGGAA 


1001 


TTTCTGACTA 


CAACTCAATC 


TTTTGGATTG 


CACACCCGGG 


TGGACCTGCA 


1051 


ATTCTTGATC 


AAGTAGAACA 


AAAGCTAGCC 


TTGAAGCCCG 


AAAAGATGAG 


1101 


GGCCACGAGG 


GAAGTT CTAA 


GTGAATATGG 


AAACATGTCA 


AGCGCATGTG 


1151 


TATTGTTCAT 


CTTAGATGAG 


ATGCGGAAGA 


AATCGGCTCA 


AAATGGACTT 


1201 


AAGACAACTG 


GAGAAGGACT 


TGATTGGGGT 


GTGTTGTTCG 


GCTTCGGACC 


1251 


AGGACTTACC 


ATTGAAACCG 


TTGTTCTTCG 


TAGCGTGGCT 


ATATAAGATG 


1301 


TGTGATTGTT 


TTTATTTTAA 


TGTATTACTT 


TTAATCTTGC 


TGCCTTGAAT 


1351 


TTCGATTTAA 


GAATAAATAA 


ATATAT CTTT 


TGATAAAAAA 


AAAAAAAAAA 


1401 


AAAAAAAAAA 


AAGTACTCTG 


CGTTGTTACC 


ACTGCTTAAT 


CGAATTC 
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1 


MVSVAEIRKA 


QRAEGPAT I L 


AIGTANPPNR 


VEQSTYPDFY 


FKITNSEHKT 


51 


ELKEKFQRMC 


DKSMIKSRYM 


YLTEEILKEN 


PSLCEYMAPS 


LDARQDMVW 


101 


EVPRLGKEAA 


VKAI KEWGQP 


KSKITHLIFC 


TTSGVDMPGA 


DYQLTKLLGL 


151 


RPYVKRYMMY 


QQGCFAGGTV 


LRLAKDLAEN 


NKGARVLWC 


SEVTAVTFRG 


201 


PSDTHLDSLV 


GQAL FGDGAA 


ALIVGSDPVP 


EIEKPIFEMV 


WTAQTIAPDS 


251 


EGAIDGHLRE 


AGLTFHLLKD 


VPGIVSKNIN 


KALVE AFQ PL 


GISDYNSIFW 


301 


IAHPGGPAIL 


DQVEQKLALK 


PE KMRATRE V 


LSEYGNMSSA 


CVLFILDEMR 


351 


KKSAQNGLKT 


TGE GLDWGVL 


FGFGPGLTIE 


TWLRSVAI 
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1 


GAATT CGATT 


AAGCAGTGGT 


AACAACGCAG 


AGTACGCGGG 


GATTCAATCT 


51 


GTTGTGCATA 


AAATTCACTC 


ATTGCATAGA 


AAAC CAT AC A 


CATTTGATCT 


101 


TGCAAAGAAG 


AAATATGGGA 


GACGAAGGTA 


TAGTGAGAGG 


TGT CACAAAG 


151 


CAGACAACCC 


CTGGGAAGGC 


TACTATATTG 


GCTCTTGGCA 


AGGCATTCCC 


201 


TCACCAACTT 


GTGATGCAAG 


AGTGTTTAGT 


TGATGGTTAT 


T TTAGGGAC A 


251 


CTAATTGTGA 


CAATCCTGAA 


CTTAAGCAGA 


AACTTGCTAG 


ACTTTGTAAG 


301 


ACAAC CACGG 


TAAAAACAAG 


GTATGTTGTT 


ATGAATGAGG 


AGATACTAAA 


351 


GAAAT AT C C A 


GAACTTGTTG 


TCGAAGGCGC 


CTCAACTGTA 


AAACAACGTT 


401 


TAGAGATATG 


TAATGAGGCA 


GTAACACAAA 


TGGCAATTGA 


AGCTTCCCAA 


451 


GTTTGCCTAA 


AGAATTGGGG 


TAGATCCTTA 


TCGGACATAA 


CTCATGTGGT 


501 


TTATGTTTCA 


TCTAGTGAAG 


CTAGATTACC 


CGGTGGTGAC 


CTATACTTGT 


551 


CAAAAGGACT 


AGGACTAAAC 


CCTAAAATTC 


AAAGAAC CAT 


GCTCTATTTC 


601 


TCTGGATGCT 


CGGGAGGCGT 


AGCCGGCCTT 


CGCGTTGCGA 


AAGACGTAGC 


651 


TGAGAACAAC 


CCTGGAAGTA 


GAGTTTTGCT 


TGCTACTTCG 


GAAACTACAA 


701 


TTATTGGATT 


CAAGCCACCA 


AGTGTTGATA 


GAC CTTATGA 


TCTTGTTGGT 


751 


GTGGCACTCT 


TTGGAGATGG 


TGCTGGTGCA 


ATGATAATTG 


GCTCAGACCC 


801 


GGTATTTGAA 


ACTGAGACAC 


CATTGTTTGA 


GCTGCATACT 


TCAGCTCAGG 


851 


AGTTTATACC 


AGACACCGAG 


AAGAAAATTG 


ATGGGCGGCT 


GACGGAGGAG 


901 


GGCATAAGTT 


TCACACTAGC 


AAGGGAACTT 


CCGCAGATAA 


TCGAAGACAA 


951 


TGTTGAGGGA 


TTCTGTAATA 


AACTAATTGA 


TGTTGTTGGG 


TTGGAGAATA 


1001 


AGGAGTACAA 


TAAGTTGTTT 


TGGGCTGTGC 


ATCCAGGTGG 


GCCTGCGATA 


1051 


TTGAATCGCG 


TGGAGAAGCG 


GCTTGAGTTG 


TCGCCGCAGA 


AGCTGAATGC 


1101 


TAGTAGAAAA 


GCT CTAATGG 


ATTATGGAAA 


TGCTAGCAGC 


AATACTATTG 


1151 


TTTATGTGCT 


GGAATATATG 


CTAGAAGAGG 


AAAAGAAGAT 


TAAAAAGGCG 


1201 


GGTGGAGGAG 


ATTCTGAATG 


GGGATTGATA 


CTTGCTTTTG 


GAC CTGGAAT 


1251 


TACTTTTGAG 


GGGATT CTAG 


CAAGGAACTT 


GTGTGCATGA 


AGTCTTATAC 


1301 


AATTGTGATG 


CATGACTTAT 


ACTCTTATTT 


CTACTAATTA 


TTATATTAAG 


1351 


CAAATTCAGA 


ACTTTTAAGT 


AATGATTTAA 


TGAAGAATAC 


TTATAGTATA 


14 U 1 


1 1 1/— 1 t\ /—i rp rp rp t\ rp 


1 LiiL 111 L.A/\ 
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1451 


ACTTGAGCAT 


GTGGAATAGT 


TGTAACAAAA 


ACT CTAAGCA 


AATAGAGACT 


1501 


TTATGTAGTA 


TAAAGCATTT 


CCAGACATGA 


TAAATAATGG 


TACCTCAGAA 


1551 


CATAAAATAT 


ATTTAGCTAT 


CTTTCATCCC 


CAACTTTACA 


CATCCACCAA 


1601 


GGTACAGAAT 


AAGCATATGT 


CAACACAAAA 


TGTACT CTAA 


GTCTAACATG 


1651 


AGT AAC C AAA 


CATGATGCCT 


GATTAAGTTA 


AAAGAAAAGA 


AAATCTGAGG 


1701 


GCATAGATCT 


TCAAT CACAC 


CACTCCAGAG 


GGAAGGCGTA 


GAACAAGCTG 
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1751 


TCCGCCGAAA 


ACACTGCAAT 


TCAATAAATA 


TCATTAGGAC 


AACAGTGCAG 


1801 


AGT CATGCGG 


GAAATGTCTT 


AAGTCACTGT 


ACTAAAAATA 


TAGGATTATA 


i ft c; i 

_L O ZD X 


TTaTHA APT A 


TAPTA arfTT 


TTPAPATA AT 


APT A A PAPA A 


ATPAPPTA Af4 


1901 


ATGAATGTCT 


GGACAATTTC 


TGAGATAAGA 


ACCATGACGG 


CCATAAGCCA 


1951 


TACCCCAAGG 


CAACCAATAA 


ATGTCCACGG 


GTAT CTAACA 


CCTGTTGCAA 


2001 


GAAATAGTAA 


GTTATTAGGA 


GATGTGCGGT 


TACGAAATTC 


AAGCTACACA 


2051 


ACAAAAGGAG 


GCCAGAACAA 


CAGCAATCTT 


GTAACCAGAT 


GACAACAATA 


2101 


AAATGTAAAC 


TTAAAGAGAC 


CGAACACACA 


AACATTGCAA 


CTCAGATGGA 


2151 


ATTGCTGCCA 


TGTAACTAGT 


AGGAGATTTG 


GGACGT C AAA 


TCAGTATATT 


2201 


ATGCAAATAC 


AAGGTATGAC 


CGCCTTGTCT 


ATTGTAGCAT 


ACAACAAACG 


2251 


TACAGTGGGT 


TTGTCCCTCT 


CAAAATGGCA 


GGATCTTTAC 


AGCACAATAT 


2301 


TTGGTTTTGT 


CATACTTATA 


CCATAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 


2351 


AAAGTACTCT 


GCGTTGTTAC 


CACTGCTTAA 


TCACTAGTGA 


ATTC 
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1 


MGDEGIVRGV 


TKQTT PGKAT 


I LALGKAFPH 


QLVMQE CLVD 


GYFRDTNCDN 


51 


PEL KQ KLARL 


CKTTTVKTRY 


WMNEEILKK 


YPELWEGAS 


TVKQRLE I CN 


101 


EAVTQMAIEA 


SQVCLKNWGR 


SLSDITHWY 


VSSSEARLPG 


GDLYLSKGLG 


151 


LNPKI QRTML 


YFSGCSGGVA 


GLRVAKDVAE 


NNPGSRVLLA 


TSETTIIGFK 


2 01 


PPSVDRPYDL 


VGVAL FGDGA 


GAMIIGSDPV 


FETETPLFEL 


HTSAQEFIPD 


251 


TEKKI DGRLT 


EEGISFTLAR 


ELPQIIEDNV 


EGFCNKLIDV 


VGLENKEYNK 


301 


LFWAVHPGGP 


A I LNRVE KRL 


ELS PQKLNAS 


RKALMDYGNA 


SSNTIVYVLE 


351 


YMLEEEKKIK 


KAGGGDSEWG 


LILAFGPGIT 


FEGI LARNLC 


A 
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1 GAATTCGATT AAGCAGTGGT AACAACGCAG AGTACGCGGG ACTAAGCCTT 
51 GATTCATTGT TTGTTTCCAT AACACAAGAA CTAGTGTTTG CTTGAATCTT 
101 AAGAAAAAAT GCCTCAAGGT GATTTGAATG GAAGTTCCTC GGTGAATGGA 
151 GCACGTGCTA GACGTGCTCC TACT C AGGGA AAGGCAACGA TACTTGCATT 
2 01 AGGAAAGGCT TTCCCCGCCC AGGTCCTCCC TCAAGAGTGC TTGGTGGAAG 

2 51 GAT T CAT T CG CGACACTAAG TGTGACGATA CTTATATTAA GGAGAAATTG 

3 01 GAGCGTCTTT GCAAAAACAC AACTGTGAAA ACAAGATACA CAGTAATGTC 
3 51 AAAGGAGATC TTAGACAACT AT C C AGAGCT AGC CATAGAT GGAAC AC CAA 
401 CAATAAGGCA AAAGCTTGAA ATAGCAAATC CAGCAGTAGT TGAAATGGCA 
451 ACAAGAGCAA GCAAAGATTG CAT C AAAGAA TGGGGAAGGT CACCTCAAGA 
501 TATCACACAC ATAGT CTATG TTTCCTCGAG CGAAATTCGT CTACCCGGTG 
551 GTGACCTTTA TCTTGCAAAT GAACTCGGCT TAAACAGCGA TGTTAATCGC 
601 GTAATGCTCT ATTTCCTCGG TTGCTACGGC GGTGT CACTG GCTTACGTGT 
651 CGCCAAAGAC ATCGCCGAAA ATAACCCTGG TAGTAGGGTG TTACT CACAA 
7 01 CAT C CGAGAC CACTATTCTC GGTTTT CGAC C AC CGAGTAA AGCTAGACCT 
751 TATGACCTCG TTGGCGCTGC ACTTTTCGGT GATGGCGCCG CTGCTGCAAT 
801 AATTGGAACA GACCCTATAT TGAATCAAGA ATCACCTTTC ATGGAATTGA 
851 AC C AT GCAGT CCAAAAATTC TTGCCTGATA CACAAAATGT GAT TGATGGT 
901 AGAAT CACTG AAGAGGGTAT TAATTTTAAG CTTGGAAGAG ACCTTCCTCA 
951 AAAAATTGAA GACAATATTG AAGAATTTTG CAAGAAAATT ATGGCTAAAA 

1001 GTGATGTTAA GGAATTTAAT GACTTATTTT GGGCTGTTCA TCCTGGTGGG 
1051 CCAGCTATAC TCAATAAGCT AGAAAATATA CTCAAATTGA AAAGTGATAA 
1101 ATTGGATTGT AGTAGGAAGG CATTAATGGA TTATGGAAAT GTTAGTAGCA 
1151 ATACTATATT C T ATGTGATG GAGTATATGA GAGATTATTT GAAGGAAGAT 
1201 GGAAGTGAAG AATGGGGATT AGGATTGGCT TTTGGACCAG GGATTACTTT 
1251 TGAAGGGGTT CTCCTCCGTA GCCTTTAATC TTGAAATAAT AATTCATATG 
1301 AAATTACTTG TCTTAAGATT GTGATAGGAA GATGAATATG TAT TGGATTA 
1351 ATATTGATAT GGTGT TATTT TAAGTTGATT TTAAAAAAAG TTTATTAATA 
1401 AAGTATGATG TAACAATTGT TGTTTGAATG TTAAAAGGGA AGTATACTAT 
1451 TTTAAGTTCT TGACCATACT GATTTTTTCT TTACACATTT TCATATCTAA 
15 01 AATTGTTCTA TGATATCTTC ATTGTTGATA CTGTAATAAT ATAATATCTA 
1551 ATTTGGCTGG CAAAATGAAA GATTTTTCAC CGAAAAAAAA AAAAAAAAAA 
1601 AAAAAAAAAA AAGTACTCTG CGTTGTTACC ACTGCTTAAT CACTAGTGAA 
1651 TTC 
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MPQGDLNGSS 


SVNGARARRA 


PTQGKATILA 


LGKAF PAQVL 


PQECLVEGFI 


51 


RDTKCDDTYI 


KEKLERLCKN 


TTVKTRYTVM 


SKEILDNYPE 


LAIDGTPTIR 


101 


QKLEIANPAV 


VEMATRASKD 


CIKEWGRSPQ 


DITH1VYVSS 


SEIRLPGGDL 


151 


YLANELGLNS 


DVNRVMLYFL 


GCYGGVTGLR 


VAKD I AENNP 


GSRVLLTTSE 


201 


TTILGFRPPS 


KAR P YD L VGA 


ALFGDGAAAA 


IIGTDPILNQ 


ESPFMELNHA 


251 


VQKFLPDTQN 


VIDGRITEEG 


I NFKLGRDL P 


QKIEDNIEEF 


CKKIMAKSDV 


301 


KEFNDLFWAV 


HPGGPAI LNK 


LENILKLKSD 


KLDCSRKALM 


DYGNVSSNTI 


351 


FYVMEYMRDY 


LKEDGSEEWG 


LGLAFGPGIT 


FEGVLLRSL 
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1 


GAATT CACTA 


GTGATTAAGC 


AGTGGTAACA 


ACGCAGAGTA 


CGCGGGGGAA 


51 


TCCACCAAAT 


CAACACCATT 


AATAACCTTC 


CAAATTCTCG 


TTACCTCACC 


101 


AAATCTCATT 


TTT CATTATA 


TATCTTGGGT 


ACATCTTTTG 


TTACCTCCAA 


151 


CAAAAAAATG 


GTGACCGTAG 


AAGAGATTCG 


TAACGCCCAA 


CGTTCAAATG 


201 


GCCCTGCCAC 


TATCTTAGCT 


TTTGGCACAG 


CCACTCCTTC 


TAACTGTGTC 


251 


ACTCAAGCTG 


ATTATCCTGA 


TTACTACTTT 


CGTATCACCA 


ACAGCGAACA 


301 


TATGACTGAT 


CTTAAGGAAA 


AATTCAAGCG 


GATGTGTGAT 


AGAT CAATGA 
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TAAAGAAACG 


TTACATGCAC 


CTAACAGAAG 


ACTTTCTGAA 


GGAGAATCCA 


401 


AATATGTGTG 


AATACATGGC 


ACCATCACTA 


GATGTAAGAC 


GAGACATAGT 


451 


GGTTGTTGAA 


GTAC CAAAGC 


TAGGTAAAGA 


AGCAGCAAAA 


AAAGC CATAT 


501 


GTGAATGGGG 


ACAACCAAAA 


TCCAAAATCA 


CACATCTTGT 


TTTCTGCACC 


551 


ACTTCCGGTG 


TTGACATGCC 


GGGAGCCGAT 


TACCAACTCA 


CCAAACTTTT 


601 


AGGCTTAAAA 


CCTTCTGTCA 


AGCGTCTCAT 


GATGTATCAA 


CAAGGTTGTT 


651 


TCGCTGGCGG 


CACAGTTCTC 


CGCTTAGCAA 


AAGACCTTGT 


TGAGAATAAC 


701 


AAAAATGCAA 


GAGTTCTTGT 


TGTTTGTTCT 


GAAATTACTG 


CGGTTACTTT 


751 


TCGTGGACCA 


TCGGATACTC 


ATCTTGATTC 


GCTCGTGGGA 


CAGGCGCTTT 


801 


TTGGTGATGG 


AGCCGCAGCA 


ATGATTATTG 


GTGCGGATCC 


TGATTTAACC 


851 


GTGGAGCGTC 


CGATTTTCGA 


GATTGTTTCG 


GCTGCTCAGA 


CTATTCTTCC 


901 


TGATTCTGAT 


GGCGCAATTG 


ATGGACATCT 


TCGTGAAGTG 


GGGCTCACTT 


951 


TTCATTTATT 


GAAAGATGTT 


C CGGGGAT T A 


TTT CAAAGAA 


C AT TGAAAAA 


1001 


AGTTTAGTTG 


AAGCTTTTGC 


GCCTATTGGG 


ATTAATGATT 


GGAACT CAAT 


1051 


ATTTTGGGTT 


GCACATCCAG 


GTGGAC CGGC 


TATTT TAGAC 


CAGGTTGAAG 


1101 


AGAAACTCCA 


TCTTAAAGAG 


GAGAAACTCC 


GGTCCACCCG 


GCATGTGCTT 


1151 


AGTGAATATG 


GAAATATGTC 


AAGTGCATGT 


GTTTTATTTA 


TTTTGGATGA 


1201 


AATGAGAAAG 


AGGT CTAAAG 


AGGAAGGGAT 


GATTACAACT 


GGTGAAGGGT 


1251 


TGGAATGGGG 


TGTGTTGTTT 


GGGTTTGGAC 


CGGGTTTAAC 


TGTTGAAACC 


1301 


GTTGTGCTTC 


ATAGTGTTCC 


GGTT CAGGGT 


TGAATTTATT 


ATACATAGAT 


1351 


TGGAAAATAA 


AATTTGCCTG 


CCGAGAGATG 


TGAAC TAACT 


TTGTAGGCAA 


1401 


GCTCAAATTA 


AAGTTTGAGA 


TAATATTGTG 


CTTTAGTTAT 


TATGGTATGT 


1451 


AATGTAATGT 


TTTTACTTTT 


TTCGAAATTC 


ATGTAATTTG 


ATATGTAAAG 


1501 


TAATATGTTT 


GGGTTGGAAT 


ATAATTATTT 


GTTAACTAAA 


AAAAAAAAAA 


1551 


AAAAAAAAAA 


AAAAAGTACT 


CTGCGTTGTT 


ACCACTGCTT 


AATCGAATTC 
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1 


MVTVEEIRNA 


QRSNGPATIL 


AFGTATPSNC 


VTQADYPDYY 


FRITNSEHMT 


51 


DLKEKFKRMC 


DRSMIKKRYM 


HLTEDFLKEN 


PNMCEYMAPS 


LDVRRD I VW 


101 


EVP KLGKE AA 


KKAICEWGQP 


KSKITHLVFC 


TTSGVDMPGA 


DYQLTKLLGL 


151 


KPSVKRLMMY 


QQGCFAGGTV 


LRLAKDLVEN 


NKNARVLWC 


SEITAVTFRG 


201 


PSDTHLDSLV 


GQALFGDGAA 


AMIIGADPDL 


TVERPIFEIV 


SAAQTILPDS 


251 


DGAIDGHLRE 


VGLTFHLLKD 


VPGIISKNIE 


KSLVEAFAPI 


GINDWNSIFW 


301 


VAHPGGPAIL 


DQVEEKLHLK 


EEKLRSTRHV 


LSEYGNMSSA 


CVLF I LDEMR 


351 


KRSKEEGMIT 


TGEGLEWGVL 


FGFGPGLTVE 


TWLHSVPVQ 


G 
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pPZP221:35S 2 TrCHSh anti 
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1 


GAATTCGATT 


AAGCAGTGGT 


AACAACGCAG 


AGTACGCGGG 


ATAAAAACTG 


51 


CACTAGTGTG 


TATAAGTTT C 


TTGGTGAAAA 


AAGAGTTTGT 


AAATTAACAT 


101 


CATGGCTAGT 


AT C AAAC AAA 


TTGGAAACAA 


GAAAGCATGT 


GTGATTGGTG 


151 


GCACTGGTTT 


TGTTGCATCT 


ATGTTGATCA 


AGCAGTTACT 


TGAAAAGGGT 


201 


TATGCTGTTA 


ATACTACCGT 


TAGAGACCCA 


GATAGCCCTA 


AGAAAAT AT C 


251 


TCACCTAGTG 


GCACTGCAAA 


GTTTGGGGGA 


ACTGAATCTA 


TTTAGAGCAG 


301 


ACTTAACAGT 


TGAAGAAGAT 


TTTGATGCTC 


CTATAGCAGG 


ATGTGAACTT 


351 


GTTTTTCAAC 


TTGCTACACC 


TGTGAACTTT 


GCTTCTCAAG 


AT C C TGAGAA 


401 


TGACATGATA 


AAGC CAGCAA 


TCAAAGGTGT 


GTTGAATGTG 


TTGAAAGCAA 


451 


TTGCAAGAGC 


AAAAGAAGTT 


AAAAGAGTTA 


TCTTAACATC 


TTCGGCAGCC 


501 


GCGGTGACTA 


TAAATGAACT 


CAAAGGGACA 


GGTCATGTTA 


TGGATGAAAC 


551 


CAACTGGTCT 


GATGTTGAAT 


TTCTCAACAC 


TGCAAAACCA 


CCCACTTGGG 


601 


GTTATCCTGC 


CTCAAAAATG 


CTAGCTGAAA 


AGGCTGCATG 


GAAATTTGCT 


651 


GAAGAAAATG 


ACATTGATCT 


AAT CACTGTG 


ATACCTAGTT 


TAACAACTGG 


701 


TCCTTCTCTC 


ACACCAGATA 


T C C CAT CT AG 


TGTTGGCTTG 


GCAATGTCTC 


751 


TAATAACAGG 


CAATGATTTT 


CTCATAAATG 


CTTTGAAAGG 


AATGCAGTTT 


801 


CTGTCGGGTT 


CGTTAT C CAT 


CACTCATGTT 

v — i.V — JL ^-.n JL VJ JL JL 


VJXT1.VJ VJXrt. JL .TTl -L JL J_ 




851 


TATATTT CTT 


GCAGAGAAAG 


AATCAGCTTC 


TGGTAGATAC 


ATTTGCTGTG 


901 


CTCACAATAC 


TAGTGTTCCC 


GAGCTTGCAA 


AGTTTCTCAA 


CAAACGATAT 


951 


CCTCAGTATA 


AAGTTCCAAC 


TGAATTTGAT 


GATTGCCCCA 


GCAAGGCAAA 


1001 


GTTGATAATC 


TCTTCTGAAA 


AGCTTAT CAA 


AGAAGGGTTC 


AGTTTCAAGC 


1051 


ATGGTATTGC 


CGAAACTTTC 


GAC CAGACTG 


TCGAGTATTT 


TAAGACTAAG 


1101 


GGGGCACTGA 


AGAATTAGAT 


TTTGATATTT 


CTAATTCAAT 


AGCAAACTCT 


1151 


AAGCTTGTTA 


TGTGTTTGTG 


AAGTTCAGAG 


TGAAATATCA 


AATGAATAAG 


1201 


TGGAGAGAGC 


ACAATAAGAG 


GAGAGCACAA 


TAATTTTGGA 


AAAAAAAAAA 


1251 


AAAAAAAAAA 


AAAAAAAAGT 


ACTCTGCGTT 


GTTACCACTG 


CTTAATCACT 


1301 


AGTGAATTC 
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1 MASIKQIGNK KACVIGGTGF VASMLIKQLL E KG YAVNT TV RDPDSPKKIS 

51 HLVALQSLGE LNLFRADLTV EEDFDAP I AG CELVFQLATP VNFASQDPEN 

101 DMIKPAIKGV LNVLKAIARA KEVKRVILTS SAAAVTINEL KGTGHVMDET 

151 NWSDVEFLNT AKPPTWGYPA SKMLAEKAAW KFAEEND I DL ITVIPSLTTG 

2 01 PSLTPDIPSS VGLAMSLITG NDFLINALKG MQFLSGSLSI THVEDICRAH 
251 IFLAEKESAS GRYICCAHNT SVPELAKFLN KRYPQYKVPT EFDDCPSKAK 

3 01 LIISSEKLIK EGFSFKHGIA ETFDQTVEYF KTKGALKN 
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pPZP221 :35S 2 TrBANa sense 




pPZP221 :35S 2 TrBANa anti 
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TrLARa 
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1 


GAATTCGATT 


AAGCAGTGGT 


AACAACGCAG 


AGTACGCGGG 


GATACCAACA 


51 


TTGTCACAAT 


TAACTCTAAA 


AGCAAAGCAA 


TGGCAC CAGC 


AGCAACATCA 


101 


TCACCAACCA 


CTCCTACTAC 


T AC C AAGGGT 


CGTGTCCTAA 


TTGTTGGAGG 


151 


AACAGGTTTC 


ATTGGAAAAT 


TTGTAACTGA 


GGCAAGTCTT 


TCCACAACAC 


201 


ACCCAACCTA 


CTTGTTGGTT 


CGGCCAGGAC 


CTCTTCTCTC 


TTCTAAGGCT 


251 


GCCACTATTA 


AGGCATTCCA 


AGAGAAAGGT 


GCCATTGTCA 


TTTATGGTCG 


301 


GGTAAATAAT 


AAGGAGTTCA 


TGGAGATGAT 


TTTGAAAAAG 


TATGAGATAA 


351 


ATGTAGTCAT 


TTCTGCAATA 


GGAGGCTCTG 


ATGGCTTGCT 


GGAACAGCTT 


401 


ACTTTGGTGG 


AGGCCATGAA 


AT CTATTAAC 


ACCATTAAGA 


GGTTTTTGCC 


451 


TTCGGAATTT 


GGT CACGATG 


TGGACAGAGC 


AAATCCTGTG 


GAACCTGGCC 


501 


TAACAATGTA 


CAAACAGAAA 


CGTTTGGTTA 


GACGTGTGAT 


CGAAGAATCT 


551 


GGTATACCAT 


ACACCTACAT 


CTGTTGCAAT 


TCGATCGCAT 


CTTGGCCGTA 


601 


CTATGACAAT 


TGT CATCCAT 


CACAGCTTCC 


TCCACCGTTG 


GAT CAATT AC 


651 


ATATTTATGG 


TCATGGCGAT 


GTCAAAGCTT 


ACTTTGTTGA 


TGGCTATGAT 


701 


ATTGGGAAAT 


T C AC AATGAA 


GGT CATTGAT 


GATGAAAGAA 


CAATCAACAA 


751 


AAATGTT CAT 


TTTCGACCTT 


CTAACAATTG 


TTATAGCATG 


AATGAGCTTG 


801 


CTTCTTTGTG 


GGAAAACAAA 


ATTGCACGAA 


AAATTCCTAG 


AGTGATCGTC 


851 


TCTGAAGACG 


ATCTTCTAGC 


AATAGC CGC A 


GAAAATTGCA 


TAC CGGAAAG 


901 


TGTCGTGGCA 


CCAATCACTC 


ATGATATATT 


CAT C AATGGA 


TGT C AAGTT A 


951 


ACTTCAAGAT 


AGATGGAATT 


CATGATGTTG 


AAATTGGCAC 


TCTATATCCT 


1001 


GGTGAAT CGG 


TAAGAAGTTT 


GGAGGAATGC 


TATGAGAAAT 


TTGTTGTCAT 


1051 


GGCGGCTGAC 


AAGATT CATA 


AAGAAGAAAC 


TGGAGTTACC 


GCAGGTGGGG 


1101 


GCGGCACAAC 


GGCTATGGTA 


GAGCCGGTGC 


CAATCACAGC 


TTCCTGTTGA 


1151 


AAAGGTTCAC 


CTGAGGTGGA 


TATTCTTTTG 


AGTCATAAGA 


CATGTTGATT 


1201 


GTTGATGTTG 


TTTTCAAGAA 


TGTTTCATCA 


TTTCATGTGT 


TTTATTAATC 


1251 


CTAAGTACAA 


ATAATTGCTG 


TCTACGTACG 


TTCTTAGTTG 


CAAAAATTCT 


1301 


TGTTATTCTC 


TATTGAGGTA 


AAAGTCTTCA 


TGTTTACAAA 


AAAAAAAAAA 


1351 


AAAAAAAAAA 


AAAAAAAAGT 


ACTCTGCGTT 


GTTACCACTG 


CTTAATCACT 


1401 


AGTGAATTC 
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1 


MAPAATSSPT 


TPTTTKGRVL 


I VGGTGF I GK 


FVTEASLSTT 


HPTYLLVRPG 


51 


PLLSSKAATI 


KAFQEKGAIV 


IYGRVMKEF 


MEMILKKYEI 


NWISAIGGS 


101 


DGLLEQLTLV 


EAMKSINTIK 


RFLPSEFGHD 


VDRANPVEPG 


LTMYKQKRLV 


151 


RRVIEESGIP 


YTYICCNSIA 


SWPYYDNCHP 


SQLPPPLDQL 


HIYGHGDVKA 


201 


YFVDGYDIGK 


FTMKVIDDER 


TIKTKNVHFRP 


SNNCYSMNEL 


ASLWENKIAR 


251 


KIPRVIVSED 


DLLAIAAENC 


I PE SWAP IT 


HDIFINGCQV 


NFKIDGIHDV 


301 


EIGTLYPGES 


VRSLEECYEK 


FWMAADKIH 


KEETGVTAGG 


GGTTAMVEPV 


351 


PITASC 
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pPZP221:35S 2 TrLARa sense 




pPZP221:35S 2 TrLARa anti 
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TrLARb 
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1 


GAATTCGATT 


AAGCAGTGGT 


AACAACGCAG 


AGTACGCGGG 


AGGATCCTTC 


51 


CATTTTGCAT 


ACCAACATTG 


TCACAATTAA 


CTCTAAAAGC 


AAAGCAATGG 


101 


CACCAGCAGC 


AAC AT CAT C A 


CCAACCACTC 


CTACTACTAC 


C AAGGGT CGT 


151 


GTCCTAATTG 


TTGGAGGAAC 


AGGTTTCATT 


GGAAAATTTG 


TAACTGAGGC 


201 


AAGTCTTTCC 


ACAACACACC 


CAACCTACTT 


GTTGGTTCGG 


CCAGGACCTC 


251 


TTCTCTCTTC 


TAAGGCTGCC 


ACTATTAAGG 


CATTCCAAGA 


GAAAGGTGC C 


301 


ATTGTCATTT 


ATGGTCGGGT 


AAATAATAAG 


GAGTTCATGG 


AGATGATTTT 


351 


GAAAAAGTAT 


GAGATAAATG 


TAGTCATTTC 


TGCAATAGGA 


GGCTCTGATG 


401 


GCTTGCTGGA 


ACAGCTTACT 


TTGGTGGAGG 


CCATGAAATC 


TATTAACACC 


451 


ATTAAGAGGT 


TTTTGCCTTC 


AGAATTTGGT 


CACGATGTGG 


ACAGAGCAAA 


501 


TCCTGTGGAA 


CCTGGCCTAA 


CAATGTACAA 


ACAGAAACGT 


TTGGTTAGAC 


551 


GTGTGAT CGA 


AGAAT CTGGT 


GT AC CATAC A 


CCTACATCTG 


TTGCAATTCG 


601 


ATCGCATCCT 


GGC CGT ACT A 


TGACAATTGT 


CATCCATCAC 


AGCTTCCTCC 


651 


AC CGTTGGAT 


CAATTACATA 


TTTATGGTCA 


TGGCGATGTC 


AAAGCTTACT 


701 


TTGTTGATGG 


CTATGATATT 


GGGAAATT C A 


CAATGAAGGT 


CATTGATGAT 


751 


GAAAGAACAA 


TCAACAAAAA 


TGTTCATTTT 


CGACCTTCTA 


ACAATTGTTA 


801 


TAGCATGAAT 


GAGCTTGCTT 


CTTTGTGGGA 


AAACAAAATT 


GCACGAAAAA 


851 


TTC CTAGAGT 


GATCGTCTCT 


GAAGACGATC 


TTCTAGCAAT 


AGC CGCAGAA 


901 


AACTGCATAC 


CGGAAAGTGT 


TGTGGCATCA 


AT CACTCATG 


ATATATTCAT 


951 


CAATGGATGT 


CAAGTTAACT 


TCAAGGTAGA 


TGGAATTCAT 


GATGTTGAAA 


1001 


TTGGCACTCT 


AT AT C CTGGT 


GAAT CGGTAA 


GAAGTTTGGA 


GGAATGCTAT 


1051 


GAGAAATTTG 


TTGTCATGGC 


GGCTGACAAG 


ATT C ATAAAG 


AAGAAACTGG 


1101 


AGTTACCGCA 


GGTGGGGGCG 


GCACAACGGC 


TATGGTAGAG 


CCGGTGCCAA 


1151 


TCACAGCTTC 


CTGTTGAAAA 


GGTTCACCTG 


AGGTGGATAT 


TCTTTTGAGT 


1201 


CATAAGACAT 


GTTGATTGTT 


GATGTTGTTT 


TCAAGAATGT 


TTCATCATTT 


1251 


CATGTGTTTT 


ATTAATCCTA 


AGTACAAATA 


ATTGCTGTCT 


ACGTACGTTC 


1301 


TTAGTTGCGA 


AAATTCTTGT 


TATTCTCTAT 


TGGGGTAAAA 


GTCTTCATGT 


1351 


TTATTGTAGT 


TGTGTTGGTT 


TTTCATATAT 


GCTATTTGCA 


ATAATGATTT 


1401 


TTGTGAAGCA 


CTTGTGGTGT 


ATTTACTTAC 


TACTGAAAAT 


AATGGTTACA 


1451 


CAAAATATAT 


AAAAAAATAA 


AAATAAGCAA 


AAAAAAAAAA 


AAAAAAAAAA 


1501 


AAAAAAAAAA 


GTACTCTGCG 


TTGTTACCAC 


TGCTTAATCA 


CTAGTGAATT 


1551 


C 
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1 MAPAATSSPT TPTTTKGRVL I VGGTGF I GK FVTEASLSTT HPTYLLVRPG 

51 PLLSSKAATI KAFQEKGAIV I YGRVNNKE F MEMILKKYEI NWISAIGGS 

101 DGLLEQLTLV EAMKSINTIK RFLPSEFGHD VDRANPVEPG LTMYKQKRLV 

151 RRVIEESGVP YTYICCNSIA SWPYYDNCHP SQLPPPLDQL HIYGHGDVKA 

2 01 YFVDGYDIGK FTMKVIDDER TINKNVHFRP SNNCYSMNEL ASLWENKIAR 
251 KIPRVIVSED DLLAIAAENC IPESWASIT HDIFINGCQV NFKVDGIHDV 

3 01 EIGTLYPGES VRSLEECYEK FWMAADKIH KEETGVTAGG GGTTAMVEPV 
3 51 PITASC 
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pPZP221:35S 2 Trl_ARb sense 



II 



35S 
terminator 



aacC1 



35S 
promoter 



V' 



CO o 



X CO 



h: 



35S 2 
promoter 



attR2 TrLARb 



a " R1 terminator Tr 



pPZP221:35S 2 TrLARb anti 
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FIGURE 29 
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1 GAATTCGATT AAGCAGTGGT AACAACGCAG AGTACGCGGG GATACCAACA 
51 TTGTCACAAT TAACT CTAAA AGTAAAGCAA TGGCACCAGC AGCAACATCA 
101 TCACCAACCA CTCCCACTAC T AC C AAGGGT CGTGTCCTAA TTGTTGGAGG 
151 AACAGGTTTC ATTGGAAAAT TTGTAACTGA GGCAAGTCTT TCCACAACAC 
2 01 ACCCAACCTA CTTGTTGGTT CGGCCAGGAC CTCTTCTCTC TTCTAAGGCT 

2 51 GCCACTATTA AGGCATTCCA AGAGAAAGGT GCCATTGTCA TTTATGGTCG 

3 01 GGTAAATAAT AAGGAGTT C A TGGAGATGAT TTTGAAAAAG TATGAGATAA 

3 51 ATGTAGT CAT TTCTGCAATA GGAGGCT CTG ATGGCTTGCT GGAACAGCTT 

4 01 ACTTTGGTGG AGGCCATGAA AT CTATTAAC ACCATTAAGA GGTTTTTGCC 
451 TTCGGAATTT GGTCACGATG TGGACAGAGC AGATCCTGTG GAACCTGGCC 

5 01 TAACAATGTA CAAACAGAAA CGTTTGGTTA GACGTGTGAT CGAAGAATCT 
551 GGT ATAC CAT ACACCTACAT CTGTTGCAAT TCGATCGCAT CTTGGCCGTA 
601 CTATGACAAT TGT CAT C CAT CACAGCTTCC TCCACCGTTG GATCAATTAC 
651 AT ATT T ATGG TCATGGCGAT GTCAAAGCTT ACTTTGTTGA TGGCTATGAT 
701 ATTGGGAAAT TCACAATGAA GGTCATTGAT GATGAAAGAA CAATCAACAA 
751 AAATGTT CAT TTTCGACCTT CTAACAATTG TTATAGCATG AATGAGCTTG 
8 01 CTTCTTTGTG GGAAAACAAA ATTGCACGAA AAATTCCTAG AGTGATCGTC 
851 TCTGAAGACG AT CTT CTAGC AATAGCCGCA GAAAATTGCA TAC CGGAAAG 
901 TGT CGTGGC A CCAATCACTC ATGATATATT CAT CAATGGA TGT CAAGT T A 
951 ACTTCAAGAT AGATGGAATT CATGATGTTG AAATTGGCAC TCTATATCCT 

1001 GGTGAATCGG TAAGAAGTTT GGAGGAATGC TATGAGAAAT TTGTTGTCAT 
1051 GGCGGCTGAC AAGATTCATA AAGAAGAAAC TGGAGTTACC GCAGGTGGGG 
1101 GCGGCACAAC GGCTATGGTA GAGCCGGTGC CAATCACAGC TTCCTGTTGA 
1151 AAAGGTTCAC CTGAGGTGGA TATTCTTTTG AGTCATAAGA CATGTTGATT 
1201 GTTGATGTTG TTTTCAAGAA TGTTTCATCA TTTCATGTGT TTTATTAATC 
1251 CTAAGTACAA ATAATTGCTG TCTACGTACG TTCTTAGTTG CAAAAATTCT 
13 01 TGTTATTCTC TATCAAAAAA AAAAAAAAAA AAAAAAAAAA AAAGTACT CT 
13 51 GCGTTGTTAC CACTGCTTAA TCACTAGTGA ATTC 
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1 MAPAATSSPT TPTTTKGRVL IVGGTGFIGK FVTEASLSTT HPTYLLVRPG 

51 PLLSSKAATI KAFQEKGAIV IYGRVNNKEF MEMILKKYEI NWISAIGGS 

101 DGLLEQLTLV EAMKSINTTK RFLPSEFGHD VDRADPVEPG LTMYKQKRLV 

151 RRVIEESGIP YTYICCNSIA SWPYYDNCHP SQLPPPLDQL HIYGHGDVKA 

2 01 YFVDGYD I GK FTMKVIDDER TINKNVHFRP SNNCYSMNEL AS LWENKI AR 
251 KIPRVIVSED DLLAIAAENC IPESWAPIT HDIFINGCQV NFKIDGIHDV 

3 01 EIGTLYPGES VRSLEECYEK FWMAADKIH KEETGVTAGG GGTTAMVEPV 
3 51 PITASC 
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pPZP221 :35S 2 TrLARc sense 




pPZP221:35S 2 TrLARc anti 
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pPZP221 :ASSU::BAN:35S 2 ::CHS 



FIGURE 33 



WO 2004/090136 



PCT/AU2004/000494 



35/40 




FIGURE 34 
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FIGURE 39 
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M80676490.ST25 
SEQUENCE LISTING 

<110> Agriculture Victoria Services Pty Ltd 
AgResearch Limited 

<120> Manipulation of condensed tannin biosynthesis 

<130> M80676490 : DLT: cl 

<150> 2003901797 
<151> 2003-04-14 

<150> 2003904369 
<151> 2003-08-14 

<160> 77 

<170> Patentln version 3.2 

<210> 1 
<211> 1447 
<212> DNA 

<213> Tri folium repens 
1 



gaattcacta 


gtgattaagc 


agtggtaaca 


acgcagagta 


cgcggggaac 


aaaaacaact 


60 


acgcatatta 


tatatatata 


tatatagtct 


ataattgaaa 


gaaactgcta 


aagatattat 


120 


taagatatgg 


tgagtgtagc 


tgaaattcgc 


aaggctcaga 


gggctgaagg 


ccctgcaacc 


180 


attttggcca 


ttggcactgc 


aaatccacca 


aaccgtgttg 


agcagagcac 


atatcctgat 


240 


ttctacttca 


aaattacaaa 


cagtgagcac 


aagactgagc 


tcaaagagaa 


gttccaacgc 


300 


atgtgtgaca 


aatccatgat 


caagagcaga 


tacatgtatc 


taacagaaga 


gattttgaaa 


360 


gaaaatccta 


gtctttgtga 


atacatggca 


ccttcattgg 


atgctaggca 


agacatggtg 


420 


gtggttgagg 


tacctagact 


tgggaaggag 


gctgcagtca 


aggccattaa 


agaatggggt 


480 


caaccaaagt 


caaagattac 


tcacttaatc 


ttttgcacca 


caagtggtgt 


tgacatgcct 


540 


ggtgctgatt 


accaactcac 


aaaactctta ggtcttcgcc 


catatgtgaa 


aaggtatatg 


600 


atgtaccaac 


aaggttgttt 


tgcaggaggc 


acggtgcttc 


gtttggcaaa 


agatttggcc 


660 


gagaacaaca 


aaggtgctcg 


tgtgctagtt 


gtttgttctg 


aagtcaccgc 


agtcacattt 


720 


cgcggcccca 


gtgatactca 


cttggacagt 


cttgttggac 


aagcattgtt 


tggagatgga 


780 


gccgctgcac 


taattgttgg 


ttctgatcca 


gtgcctgaaa 


ttgagaaacc 


aatatttgag 


840 


atggtttgga 


ctgcacaaac 


aattgctcca gacagtgaag 


gtgccattga 


tggtcatctt 


900 


cgtgaagctg 


ggctaacatt 


tcatcttctt 


aaagatgttc 


ctgggattgt 


atcaaagaac 


960 


attaataaag 


cattggttga 


ggctttccaa 


ccattaggaa 


tttctgacta 


caactcaatc 


1020 


ttttggattg 


cacacccggg 


tggacctgca 


attcttgatc 


aagtagaaca 


aaagctagcc 


1O80 


ttgaagcccg 


aaaagatgag 


ggccacgagg 


gaagttctaa 


gtgaatatgg 


aaacatgtca 


1140 


agcgcatgtg 


tattgttcat 


cttagatgag 


atgcggaaga 


aatcggctca 


aaatggactt 


1200 


aagacaactg 


gagaaggact 


tgattggggt 


gtgttgttcg 


gcttcggacc 


aggacttacc 


1Z60 


attgaaaccg 


ttgttcttcg 


tagcgtggct 


atataagatg 


tgtgattgtt 


tttattttaa 


1320 
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tgtattactt ttaatcttgc tgccttgaat ttcgatttaa gaataaataa atatatcttt 1380 
tgataaaaaa aaaaaaaaaa aaaaaaaaaa aagtactctg cgttgttacc actgcttaat 1440 
cgaattc 1447 

<210> 2 
<211> 389 
<212> PRT 

<213> Tri foli urn repens 
<400> 2 

Met Val Ser Val Ala Glu lie Arg Lys Ala Gin Arg Ala Glu Gly Pro 
15 10 15 

Ala Thr lie Leu Ala lie Gly Thr Ala Asn Pro Pro Asn Arg val Glu 
20 25 30 

Gin Ser Thr Tyr Pro Asp Phe Tyr Phe Lys lie Thr Asn Ser Glu His 
35 40 45 

Lys Thr Glu Leu Lys Glu Lys Phe Gin Arg Met Cys Asp Lys ser Met 

50 55 60 

lie Lys ser Arg Tyr Met Tyr Leu Thr Glu Glu lie Leu Lys Glu Asn 
65 70 75 80 

Pro Ser Leu cys Glu Tyr Met Ala Pro Ser Leu Asp Ala Arg Gin Asp 
85 90 95 

Met Val Val Val Glu val Pro Arg Leu Gly Lys Glu Ala Ala val Lys 
100 105 110 

Ala lie Lys Glu Trp Gly Gin Pro Lys Ser Lys lie Thr His Leu lie 
115 120 125 

Phe Cys Thr Thr Ser Gly Val Asp Met Pro Gly Ala Asp Tyr Gin Leu 
130 135 140 

Thr Lys Leu Leu Gly Leu Arg Pro Tyr val Lys Arg Tyr Met Met Tyr 
145 150 155 160 

Gin Gin Gly Cys Phe Ala Gly Gly Thr val Leu Arg Leu Ala Lys Asp 
165 170 175 

Leu Ala Glu Asn Asn Lys Gly Ala Arg Val Leu Val Val cys ser Glu 
180 185 190 

val Thr Ala Val Thr Phe Arg Gly Pro Ser Asp Thr His Leu Asp ser 
195 200 205 

Leu Val Gly Gin Ala Leu Phe Gly Asp Gly Ala Ala Ala Leu lie Val 
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210 215 220 

Gly ser Asp Pro Val Pro Glu lie G"iu Lys Pro lie Phe Glu Met val 
225 230 235 240 

Trp Thr Ala Gin Thr lie Ala Pro Asp Ser Glu Gly Ala lie Asp Gly 
245 250 255 

His Leu Arg Glu Ala Gly Leu Thr Phe His Leu Leu Lys Asp val Pro 
260 265 270 

Gly lie val ser Lys Asn lie Asn Lys Ala Leu Val Glu Ala Phe Gin 
275 280 285 

Pro Leu Gly lie Ser Asp Tyr Asn Ser lie Phe Trp lie Ala His Pro 
290 295 300 

Gly Gly Pro Ala lie Leu Asp Gin Val Glu Gin Lys Leu Ala Leu Lys 
305 310 315. 320 

Pro Glu Lys Met Arg Ala Thr Arg Glu Val Leu Ser Glu Tyr Gly Asn 

325 ~ 330 335 

Met Ser Ser Ala Cys val Leu Phe lie Leu Asp Glu Met Arg Lys Lys 
340 345 350 

ser Ala Gin Asn Gly Leu Lys Thr Thr Gly Glu Gly Leu Asp Trp Gly 
355 360 365 

Val Leu Phe Gly Phe Gly Pro Gly Leu Thr lie Glu Thr val val Leu 

370 375 380 

Arg ser val Ala lie 
385 

<210> 3 

<211> 2394 

<212> DNA 

<213> Tri folium repens 

<400> 3 



gaattcgatt 


aagcagtggt 


aacaacgcag 


agtacgcggg 


gattcaatct 


gttgtgcata 


60 


aaattcactc 


attgcataga 


aaaccataca 


catttgatct tgcaaagaag 


aaatatggga 


120 


gacgaaggta 


tagtgagagg 


tgtcacaaag 


cagacaaccc 


ctgggaaggc 


tactatattg 


180 


gctcttggca 


aggcattccc 


tcaccaactt 


gtgatgcaag 


agtgtttagt 


tgatggttat 


240 


tttagggaca 


ctaattgtga 


caatcctgaa 


cttaagcaga 


aacttgctag 


actttgtaag 


300 


acaaccacgg 


taaaaacaag 


gtatgttgtt 


atgaatgagg 


agatactaaa 


gaaatatcca 


360 


gaacttgttg 


tcgaaggcgc 


ctcaactgta 


aaacaacgtt 


tagagatatg 


taatgaggca 


420 


gtaacacaaa 


tggcaattga 


agcttcccaa 


gtttgcctaa 


agaattgggg 


tagatcctta 


480 
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tcggacataa ctcatgtggt ttatgtttca tctagtgaag ctagattacc cggtggtgac 540 

ctatacttgt caaaaggact aggactaaac cctaaaattc aaagaaccat gctctatttc 600 

tctggatgct cgggaggcgt agccggcctt cgcgttgcga aagacgtagc tgagaacaac 660 

cctggaagta gagttttgct tgctacttcg gaaactacaa ttattggatt caagccacca 720 

agtgttgata gaccttatga tcttgttggt gtggcactct ttggagatgg tgctggtgca 780 

atgataattg gctcagaccc ggtatttgaa actgagacac cattgtttga gctgcatact 840 

tcagctcagg agtttatacc agacaccgag aagaaaattg atgggcggct gacggaggag 900 

ggcataagtt tcacactagc aagggaactt ccgcagataa tcgaagacaa tgttgaggga 960 

ttctgtaata aactaattga tgttgttggg ttggagaata aggagtacaa taagttgttt 1020 

tgggctgtgc atccaggtgg gcctgcgata ttgaatcgcg tggagaagcg gcttgagttg 1080 

tcgccgcaga agctgaatgc tagtagaaaa gctctaatgg attatggaaa tgctagcagc 1140 

aatactattg tttatgtgct ggaatatatg ctagaagagg aaaagaagat taaaaaggcg 1200 

ggtggaggag attctgaatg gggattgata cttgcttttg gacctggaat tacttttgag 1260 

gggattctag caaggaactt gtgtgcatga agtcttatac aattgtgatg catgacttat 1320 

actcttattt ctactaatta ttatattaag caaattcaga acttttaagt aatgatttaa 1380 

tgaagaatac ttatagtata ttgactttat tcactttcaa agcaagttta tgatcctaag 1440 

acatggtaga acttgagcat gtggaatagt tgtaacaaaa actctaagca aatagagact 1500 

ttatgtagta taaagcattt. ccagacatga taaataatgg tacctcagaa cataaaatat 1560 

atttagctat ctttcatccc caactttaca catccaccaa ggtacagaat aagcatatgt 1620 

caacacaaaa tgtactctaa gtctaacatg agtaaccaaa catgatgcct gattaagtta 1680 

aaagaaaaga aaatctgagg gcatagatct tcaatcacac cactccagag ggaaggcgta 1740 

gaacaagctg tccgccgaaa acactgcaat tcaataaata tcattaggac aacagtgcag 1800 

agtcatgcgg gaaatgtctt aagtcactgt actaaaaata taggattata ttatgaacta 1860 

tactaacctt ttcacataat agtaacagaa atcagctaag atgaatgtct ggacaatttc 1920 

tgagataaga accatgacgg ccataagcca taccccaagg caaccaataa atgtccacgg 1980 

gtatctaaca cctgttgcaa gaaatagtaa gttattagga gatgtgcggt tacgaaattc 2040 

aagctacaca acaaaaggag gccagaacaa cagcaatctt gtaaccagat gacaacaata 2100 

aaatgtaaac ttaaagagac cgaacacaca aacattgcaa ctcagatgga attgctgcca 2160 

tgtaactagt aggagatttg ggacgtcaaa tcagtatatt atgcaaatac aaggtatgac 2220 

cgccttgtct attgtagcat acaacaaacg tacagtgggt ttgtccctct caaaatggca 2280 

ggatctttac agcacaatat ttggttttgt catacttata ccataaaaaa aaaaaaaaaa 2340 

aaaaaaaaaa aaagtactct gcgttgttac cactgcttaa tcactagtga attc 2394 



<210> 4 
<211> 391 
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<212> PRT 

<213> Tri foli urn repens 
<400> 4 

Met Gly Asp Glu Gly lie Val Arg Gly Val Thr Lys Gin Thr Thr Pro 
15 ^10 15 

Gly Lys Ala Thr lie Leu Ala Leu Gly Lys Ala Phe Pro His Gin Leu 
20 25 30 

Val Met Gin Glu Cys Leu val Asp Gly Tyr Phe Arg Asp Thr Asn Cys 
35 40 45 

Asp Asn Pro Glu Leu Lys Gin Lys Leu Ala Arg Leu cys Lys Thr Thr 
50 55 60 

Thr Val Lys Thr Arg Tyr Val Val Met Asn Glu Glu lie Leu Lys Lys 
65 70 75 80 

Tyr Pro Glu Leu val Val Glu Gly Ala ser Thr Val Lys Gin Arg Leu 
85 90 95 

Glu lie cys Asn Glu Ala Val Thr Gin Met Ala lie Glu Ala Ser Gin 
100 105 110 

val Cys Leu Lys Asn Trp Gly Arg Ser Leu Ser Asp lie Thr His Val 
115 120 125 

val Tyr val Ser Ser Ser Glu Ala Arg Leu Pro Gly Gly Asp Leu Tyr 
130 135 140 

Leu ser Lys Gly Leu Gly Leu Asn Pro Lys lie Gin Arg Thr Met Leu 
145 150 155 " 160 

Tyr Phe Ser Gly Cys ser Gly Gly Val Ala Gly Leu Arg val Ala Lys 
165 170 175 

Asp val Ala Glu Asn Asn Pro Gly ser Arg val Leu Leu Ala Thr ser 
180 185 190 

Glu Thr Thr lie lie Gly Phe Lys Pro Pro ser val Asp Arg Pro Tyr 
195 200 205 

Asp Leu val Gly val Ala Leu Phe Gly Asp Gly Ala Gly Ala Met lie 
210 215 220 

lie Gly ser Asp Pro Val Phe Glu Thr Glu Thr Pro Leu Phe Glu Leu 
225 230 235 240 

His Thr Ser Ala Gin Glu Phe lie Pro Asp Thr Glu Lys Lys lie Asp 
245 250 255 
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Gly Arg Leu Thr Glu Glu Gly lie Ser Phe Thr Leu Ala Arg Glu Leu 
260 265 270 

pro Gin lie lie Glu Asp Asn Val Glu Gly Phe Cys Asn Lys Leu lie 
275 280 285 

Asp val Val Gly Leu Glu Asn Lys Glu Tyr Asn Lys Leu Phe Trp Ala 
290 295 300 

val His Pro Gly Gly Pro Ala lie Leu Asn Arg val Glu Lys Arg Leu 
305 310 315 320 

Glu Leu ser Pro Gin Lys Leu Asn Ala ser Arg Lys Ala Leu Met Asp 
325 330 335 

Tyr Gly Asn Ala ser ser Asn Thr lie Val Tyr Val Leu Glu Tyr Met 
340 345 350 

Leu Glu Glu Glu Lys Lys lie Lys Lys Ala Gly Gly Gly Asp Ser Glu 
355 360 365 

Trp Gly Leu lie Leu Ala Phe Gly Pro Gly lie Thr Phe Glu Gly lie 

370 375 380 

Leu Ala Arg Asn Leu Cys Ala 
385 390 

<210> 5 

<211> 1653 

<212> DNA 

<213> Tri folium repens 



<400> 5 
gaattcgatt 


aagcagtggt 


aacaacgcag 


agtacgcggg 


actaagcctt 


gattcattgt 


60 


ttgtttccat 


aacacaagaa 


ctagtgtttg 


cttgaatctt 


aagaaaaaat 


gcctcaaggt 


120 


gatttgaatg 


gaagttcctc 


ggtgaatgga gcacgtgcta 


gacgtgctcc 


tactcaggga 


180 


aaggcaacga 


tacttgcatt 


aggaaaggct 


ttccccgccc aggtcctccc 


tcaagagtgc 


240 


ttggtggaag 


gattcattcg 


cgacactaag tgtgacgata cttatattaa ggagaaattg 


300 


gagcgtcttt 


gcaaaaacac 


aactgtgaaa acaagataca 


cagtaatgtc 


aaaggagatc 


360 


ttagacaact 


atccagagct 


agccatagat 


ggaacaccaa 


caataaggca 


aaagcttgaa 


420 


atagcaaatc 


cagcagtagt 


tgaaatggca 


acaagagcaa 


gcaaagattg 


catcaaagaa 


480 


tggggaaggt 


cacctcaaga 


tatcacacac 


atagtctatg 


tttcctcgag 


cgaaattcgt 


540 


ctacccggtg 


gtgaccttta 


tcttgcaaat 


gaactcggct 


taaacagcga tgttaatcgc 


600 


gtaatgctct 


atttcctcgg 


ttgctacggc 


ggtgtcactg 


gcttacgtgt 


cgccaaagac 


660 


atcgccgaaa 


ataaccctgg 


tagtagggtg 


ttactcacaa 


catccgagac 


cactattctc 


720 


ggttttcgac 


caccgagtaa 


agctagacct 


tatgacctcg 


ttggcgctgc 


acttttcggt 


780 
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gatggcgccg ctgctgcaat aattggaaca gaccctatat tgaatcaaga atcacctttc 
atggaattga accatgcagt ccaaaaattc ttgcctgata cacaaaatgt gattgatggt 
agaatcactg aagagggtat taattttaag cttggaagag accttcctca aaaaattgaa 

gacaatattg aagaattttg caagaaaatt atggctaaaa gtgatgttaa ggaatttaat 1020 

gacttatttt gggctgttca tcctggtggg ccagctatac tcaataagct agaaaatata 1080 

ctcaaattga aaagtgataa attggattgt agtaggaagg cattaatgga ttatggaaat 1140 

gttagtagca atactatatt ctatgtgatg gagtatatga gagattattt gaaggaagat 1200 

ggaagtgaag aatggggatt aggattggct tttggaccag ggattacttt tgaaggggtt 1260 

ctcctccgta gcctttaatc ttgaaataat aattcatatg aaattacttg tcttaagatt 1320 

gtgataggaa gatgaatatg tattggatta atattgatat ggtgttattt taagttgatt 1380 

ttaaaaaaag tttattaata aagtatgatg taacaattgt tgtttgaatg ttaaaaggga 1440 

agtatactat tttaagttct tgaccatact gattttttct ttacacattt tcatatctaa 1500 

aattgttcta tgatatcttc attgttgata ctgtaataat ataatatcta atttggctgg 1560 

caaaatgaaa gatttttcac cgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aagtactctg 1620 

cgttgttacc actgcttaat cactagtgaa ttc 1653 

<210> 6 
<211> 389 
<212> PRT 

<213> Tri folium repens 
<400> 6 

met Pro Gin Gly Asp Leu Asn Gly ser ser ser val Asn Gly Ala Arg 
15 10 15 

Ala Arg Arg Ala Pro Thr Gin Gly Lys Ala Thr lie Leu Ala Leu Gly 
20 25 30 

Lys Ala Phe Pro Ala Gin Val Leu Pro Gin Glu Cys Leu Val Glu Gly 

35 40 45 

Phe He Arg Asp Thr Lys cys Asp Asp Thr Tyr He Lys Glu Lys Leu 
50 55 60 

Glu Arg Leu cys Lys Asn Thr Thr val Lys Thr Arg Tyr Thr Val Met 
65 70 75 80 

ser Lys Glu lie Leu Asp Asn Tyr Pro Glu Leu Ala lie Asp Gly Thr 
85 90 95 

Pro Thr lie Arg Gin Lys Leu Glu lie Ala Asn Pro Ala Val val Glu 
100 105 110 

Met Ala Thr Arg Ala Ser Lys Asp cys lie Lys Glu Trp Gly Arg Ser 

Page 7 



WO 2004/090136 



PCT/AU2004/000494 



M80676490.ST25 



115 



120 



125 



Pro Gin Asp 
130 



He 



Thr 



His 



lie 

135 



val 



Tyr 



val 



Ser 



ser 
140 



ser 



Glu 



lie 



Arg 



Leu Pro Gly Gly Asp Leu Tyr Leu Ala Asn Glu Leu Gly Leu Asn Ser 
145 150 155 " 160 



Asp val Asn Arg Val Met Leu Tyr Phe Leu Gly cys Tyr Gly Gly val 
165 170 175 



Thr Gly Leu Arg val Ala Lys Asp. lie Ala Glu Asn Asn Pro Gly ser 
180 185 190 



Arg val Leu Leu Thr Thr ser Glu Thr Thr lie Leu Gly Phe Arg pro 
195 200 205 



Pro Ser Lys Ala Arg Pro Tyr Asp Leu Val Gly Ala Ala Leu Phe Gly 
210 215 220 



Asp Gly Ala Ala Ala Ala lie lie Gly Thr Asp Pro lie Leu Asn Gin 

225 230 ~ 235 240 



Glu Ser Pro Phe Net Glu Leu Asn His Ala Val Gin Lys Phe Leu Pro 
245 250 255 



Asp Thr Gin Asn val lie Asp Gly Arg lie Thr Glu Glu Gly lie Asn 
260 265 270 



Phe Lys Leu Gly Arg Asp Leu Pro Gin Lys lie Glu Asp Asn lie Glu 

275 " 280 285 



Glu Phe Cys Lys Lys lie Met Ala Lys ser Asp val Lys Glu Phe Asn 
290 295 300 



Asp Leu Phe Trp Ala val His Pro Gly Gly Pro Ala lie Leu Asn Lys 
305 310 315 320 



Leu Glu Asn lie Leu Lys Leu Lys Ser Asp Lys Leu Asp cys Ser Arg 
325 330 335 



Lys Ala Leu Met Asp Tyr Gly Asn Val ser Ser Asn Thr lie Phe Tyr 
340 345 350 



val Met Glu Tyr Met Arg Asp Tyr Leu Lys Glu Asp Gly ser Glu Glu 
355 360 365 



Trp Gly Leu Gly Leu Ala Phe Gly Pro Gly lie Thr Phe Glu Gly val 
370 375 380 



Leu Leu Arg ser Leu 
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385 

<210> 7 
<211> 1600 
<212> DNA 

<213> Tri folium repens 



<400> 7 
gaatxcac ia 


gxgaxxaagc 


a g x g y T. a. a a. 


cLLy Ldyay La 


Ly Ly yyyydd 


LtuciLLaaa t- 


60 


caacaccaxx 


aaiaaccnc 


CadaUi.LT.Ly 


L LdLL LLdLL 


ddd L L L Ld L L 


L L LLu L La La 


120 


tatcttgggt 


acaxc ling 


IiaCLLLLad 


Lddddddd Ly 


y Ly dLLy Ldy 


aananal — \~ en 
aayaya L LLy 


180 


xaacgcccaa 


cgxxcaaaxg 


gcccxgc-C-dL. 


Ld LL L Ldy L L 


tttnnraran 
L L LyyLdLdy 


LLaL LLL L LL 


240 


taactgtgtc 


acxcaagcxg 


aLXdT.LLX.yd 


L LdL LdL L.LL 


Ly Ld LLdLLd 


aranrnaara 
dLdy LyddLd 


300 


x at: g act: gat: 


cxxaaggaaa 


aaxxcddyLy 


yd Ly ty Lyd l 


anatraatna 
dy d LLdd Lyd 


Ldddy aaciLy 


360 


xxacaxgcac 


cxaacagaag 


dCXXXLXydd 


nnanaatrra 
yydydd LL La 


aatatntritn 
dd Ld Ly Ly Ly 


ad Lata Lyyu 


420 


accaxcac ia 


/*i +■ n -f- "3 n *a p 
ga LyXaaydL 


ydy aLd LdLJ L 


y y l Ly l tyaa 


y LuLLuua^C 


cay y Laaay u 


480 


agcagcaaaa 




gxgaaxgggy 


aLaaLLaaaa 


LLLaaaa LLu 


LuLa L L. L Ly L. 


540 


tzttctgcacc 


acxxccggxg 


xxgacaxgLL 


ggg a g c Lgax 


LaLLaaL L Ld 


LLdddL L L L L 


600 


aggc ttaaaa 


ccxxcxgxca 


agcgxcxcax 


gdXyXdXLdd 


Lddyy l Ly l l 


trnrtnnrnn 
LLy l Lyy Ly y 


660 


cacagxtx tx 


cgcxxagcaa 


-3 /-f «-) /— <~ "f- "f* f~1 "f" 

adydLCXXgX 


xgagddXddL 


3 3 3 3 3 n <^ 3 3 

ddddd Ly Ldd 


yay llllli|l 


770 


tgtttgttct 


gaaaxxacxg 


cggxxacxxx 


xcyxgydLLd 


LLyydLdL LL 


a l l l l y a l l l 


780 


gctcgtggga 


caggcgct tit 


xxggxgaxgg 


agccgLagLd 


dXgdXXdXXy 


y xy Lyyd lll 




tgatttaacc 


gtggagcgtc 


cgaxxxxcga 


gaxxgxxxcg 


gLxgLXLdya 


L Ld L LL L LLL 




"tgattctgat 


ggcgcaattg 


axggacaxcx 


xcgxgaagxg 


gggLXCdCxx 


1- +* r~ 1- +• 1- ^ -t* 1- 
LLLdLLLdLL 


_?v»Lf 


gaaagatgtt 


ccggggaxxa 


xxxcaaagaa 


caxxyaaddd 


dy L L Lay L Ly 


ddy L L L L Ly L 




gcctattggg 


axxaaxgaxx 


ggaacxcaax 


axxxxgggxx 


nr~3 ra"t*rra n 
yLdLdLLLdy 


y Lyy dLLy y l 


1 OR0 


tattttagac 


caggttgaag 


agaaactcca 


tcttaaagag 


gagaaactcc 


ggtccacccg 


1140 


gcatgtgctt 


agtgaatatg 


gaaatatgtc 


aagtgcatgt 


gttttattta 


ttttggatga 


1200 


aatgagaaag 


aggtctaaag 


aggaagggat 


gattacaact 


ggtgaagggt 


tggaatgggg 


1260 


tgtgttgttt 


gggtttggac 


cgggtttaac 


tgttgaaacc 


gttgtgcttc 


atagtgttcc 


1320 


ggttcagggt 


tgaatttatt 


atacatagat 


tggaaaataa 


aatttgcctg 


ccgagagatg 


1380 


tgaactaact 


ttgtaggcaa 


gctcaaatta 


aagtttgaga 


taatattgtg 


ctttagttat 


1440 


tatggtatgt 


aatgtaatgt 


ttttactttt 


ttcgaaattc 


atgtaatttg 


atatgtaaag 


1500 


taatatgttt 


gggttggaat 


ataattattt 


gttaactaaa 


aaaaaaaaaa 


aaaaaaaaaa 


1560 


aaaaagtact 


ctgcgttgtt 


accactgctt 


aatcgaattc 






1600 



<210> 8 
<211> 391 
<212> PRT 

<213> Tri folium repens 
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<400> 8 

Met Val Thr val Glu Glu lie Arg Asn Ala Gin Arg ser Asn Gly Pro 
15 10 15 

Ala Thr lie Leu Ala Phe Gly Thr Ala Thr Pro Ser Asn cys Val Thr 
20 25 30 

Gin Ala Asp Tyr Pro Asp Tyr Tyr Phe Arg lie Thr Asn ser Glu His 
35 40 45 

Met Thr Asp Leu Lys Glu Lys Phe Lys Arg Met Cys Asp Arg Ser Met 
50 55 60 

lie Lys Lys Arg Tyr Met His Leu Thr Glu Asp Phe Leu Lys Glu Asn 
65 .70 75 80 

Pro Asn Met cys Glu Tyr Met Ala Pro ser Leu Asp val Arg Arg Asp 
85 90 95 

lie Val Val val Glu Val Pro Lys Leu Gly Lys Glu Ala Ala Lys Lys 
100 105 110 

Ala lie Cys Glu Trp Gly Gin Pro Lys ser Lys lie Thr His Leu Val 
115 120 125 

Phe Cys Thr Thr Ser Gly val Asp Met Pro Gly Ala Asp Tyr Gin Leu 
130 135 140 

Thr Lys Leu Leu Gly Leu Lys Pro Ser Val Lys Arq Leu Met Met Tyr 
145 150 155 " 160 

Gin Gin Gly Cys Phe Ala Gly Gly Thr val Leu Arg Leu Ala Lys Asp 
165 170 " 175 

Leu Val Glu Asn Asn Lys Asn Ala Arg Val Leu Val Val Cys ser Glu 
180 185 190 

lie Thr Ala Val Thr Phe Arg Gly Pro ser Asp Thr His Leu Asp Ser 
195 200 205 

Leu Val Gly Gin Ala Leu Phe Gly Asp Gly Ala Ala Ala Met lie lie 
210 215 220 

Gly Ala Asp Pro Asp Leu Thr Val Glu Arg Pro lie Phe Glu lie Val 
225 230 235 240 

ser Ala Ala Gin Thr lie Leu Pro Asp ser Asp Gly Ala lie Asp Gly 
245 250 255 

His Leu Arg Glu Val Gly Leu Thr Phe His Leu Leu Lys Asp val Pro 
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265 



270 



Gly lie 



He 
275 



ser Lys Asn 



lie G"lu Lys ser 
280 



Leu 



val Glu Ala Phe Ala 
285 



Pro lie Gly lie Asn Asp Trp Asn ser lie Phe Trp Val Ala His Pro 
290 295 300 

Gly Gly Pro Ala lie Leu Asp Gin Val Glu Glu Lys Leu His Leu Lys 
305 310 315 320 

Glu Glu Lys Leu Arg ser Thr Arg His Val Leu Ser Glu Tyr Gly Asn 
325 ~ 330 335 

Met Ser ser Ala Cys Val Leu Phe lie Leu Asp Glu Met Arg Lys Arg 
340 345 350 

ser Lys Glu Glu Gly Met lie Thr Thr Gly Glu Gly Leu Glu Trp Gly 
355 360 365 

val Leu Phe Gly Phe Gly Pro Gly Leu Thr Val Glu Thr val val Leu 

370 375 380 

His Ser val Pro Val Gin Gly 
385 390 

<210> 9 

<211> 1309 

<212> DNA 

<213> Tri folium repens 

<400> 9 



gaattcgatt 


aagcagtggt 


aacaacgcag 


agtacgcggg 


ataaaaactg 


cactagtgtg 


tataagtttc 


ttggtgaaaa 


aagagtttgt 


aaattaacat 


catggctagt 


atcaaacaaa 


ttggaaacaa 


gaaagcatgt 


gtgattggtg 


gcactggttt 


tgttgcatct 


atgttgatca 


agcagttact 


tgaaaagggt 


tatgctgtta 


atactaccgt 


tagagaccca 


gatagcccta 


agaaaatatc 


tcacctagtg 


gcactgcaaa 


gtttggggga 


actgaatcta 


tttagagcag 


acttaacagt 


tgaagaagat 


tttgatgctc 


ctatagcagg 


atgtgaactt 


gtttttcaac 


ttgctacacc 


tgtgaacttt 


gcttctcaag 


atcctgagaa 


tgacatgata 


aagccagcaa 


tcaaaggtgt 


gttgaatgtg 


ttgaaagcaa 


ttgcaagagc 


aaaagaagtt 


aaaagagtta 


tcttaacatc 


ttcggcagcc 


gcggtgacta 


taaatgaact 


caaagggaca 


ggtcatgtta 


tggatgaaac 


caactggtct 


gatgttgaat 


ttctcaacac 


tgcaaaacca 


cccacttggg 


gttatcctgc 


ctcaaaaatg 


ctagctgaaa 


aggctgcatg 


gaaatttgct 


gaagaaaatg 


acattgatct 


aatcactgtg 


atacctagtt 


taacaactgg 


tccttctctc 


acaccagata 


tcccatctag 


tgttggcttg 


gcaatgtctc 


taataacagg 


caatgatttt 


ctcataaatg 


ctttgaaagg 


aatgcagttt 


ctgtcgggtt 


cgttatccat 


cactcatgtt 


gaggatattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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gccgagctca 


tatatttctt 


gcagagaaag 


aatcagcttc 


tggtagatac 


atttgctgtg 


900 


ctcacaatac 


tagtgttccc 


gagcttgcaa 


agtttctcaa 


caaacgatat 


cctcagtata 


960 


aagttccaac 


tgaatttgat 


gattgcccca 


gcaaggcaaa 


gttgataatc 


tcttctgaaa 


1020 


agcttatcaa 


agaagggttc 


agtttcaagc 


atggtattgc 


cgaaactttc 


gaccagactg 


1080 


tcgagtattt 


taagactaag 


ggggcactga 


agaattagat 


tttgatattt 


ctaattcaat 


1140 


agcaaactct 


aagcttgtta tgtgtttgtg 


aagttcagag 


tgaaatatca 


aatgaataag 


1200 


tggagagagc 


acaataagag 


gagagcacaa 


taattttgga 


aaaaaaaaaa 


aaaaaaaaaa 


1260 


aaaaaaaagt 


actctgcgtt 


gttaccactg 


cttaatcact 


agtgaattc 




1309 



<210> 10 

<211> 338 

<212> PRT 

<213> Tri folium repens 

<400> 10 

Met Ala Ser lie Lys Gin lie Gly Asn Lys Lys Ala Cys val lie Gly 
1 5 10 15 

Gly Thr Gly Phe val Ala ser Met Leu lie Lys Gin Leu Leu Glu Lys 
20 25 30 

Gly Tyr Ala val Asn Thr Thr val Arg Asp Pro Asp Ser Pro Lys Lys 
35 40 ~ 45 

lie Ser His Leu val Ala Leu Gin Ser Leu Gly Glu Leu Asn Leu Phe 
50 55 60 

Arg Ala Asp Leu Thr Val Glu Glu Asp Phe Asp Ala Pro lie Ala Gly 
65 70 75 80 

Cys Glu Leu val Phe Gin Leu Ala Thr Pro Val Asn Phe Ala Ser Gin 
85 90 95 

Asp Pro Glu Asn Asp Met lie Lys Pro Ala lie Lys Gly val Leu Asn 
100 105 110 

val Leu Lys Ala lie Ala Arg Ala Lys Glu Val Lys Arg val lie Leu 
115 120 125 

Thr Ser Ser Ala Ala Ala val Thr lie Asn Glu Leu Lys Gly Thr Gly 
130 135 140 

His val Met Asp Glu Thr Asn Trp Ser Asp Val Glu Phe Leu Asn Thr 
145 . 150 155 160 

Ala Lys Pro Pro Thr Trp Gly Tyr Pro Ala Ser Lys Met Leu Ala Glu 
165 * 170 175 
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Lys Ala Ala Trp Lys Phe Ala Glu Glu Asn Asp lie Asp Leu lie Thr 
180 185 190 

val lie Pro Ser Leu Thr Thr Gly Pro Ser Leu Thr Pro Asp lie Pro 
195 200 205 

Ser Ser val Gly Leu Ala Met Ser Leu He Thr Gly Asn Asp Phe Leu 
210 215 220 

lie Asn Ala Leu Lys Gly Met Gin Phe Leu ser Gly ser Leu ser lie 
225 230 235 240 

Thr His Val Glu Asp lie cys Arg Ala His lie Phe Leu Ala Glu Lys 
245 ~ 250 255 

Glu ser Ala ser Gly Arg Tyr lie Cys Cys Ala His Asn Thr ser Val 
260 265 270 

Pro Glu Leu Ala Lys Phe Leu Asn Lys Arg Tyr Pro Gin Tyr Lys Val 
275 280 285 

Pro Thr Glu Phe Asp Asp Cys Pro Ser Lys Ala Lys Leu lie lie Ser 
290 295 300 

ser Glu Lys Leu lie Lys Glu Gly Phe Ser Phe Lys His Gly lie Ala 
305 310 315 320 

Glu Thr Phe Asp Gin Thr val Glu Tyr Phe Lys Thr Lys Gly Ala Leu 

325 330 335 

Lys Asn 



<210> 11 
<211> 1409 
<212> DNA 

<213> Tri folium repens 
<400> 11 

gaattcgatt aagcagtggt aacaacgcag agtacgcggg gataccaaca ttgtcacaat 60 

taactctaaa agcaaagcaa tggcaccagc agcaacatca tcaccaacca ctcctactac 120 

taccaagggt cgtgtcctaa ttgttggagg aacaggtttc attggaaaat ttgtaactga 180 

ggcaagtctt tccacaacac acccaaccta cttgttggtt cggccaggac ctcttctctc 240 

ttctaaggct gccactatta aggcattcca agagaaaggt gccattgtca tttatggtcg 300 

ggtaaataat aaggagttca tggagatgat tttgaaaaag tatgagataa atgtagtcat 360 

ttctgcaata ggaggctctg atggcttgct ggaacagctt actttggtgg aggccatgaa 420 

atctattaac accattaaga ggtttttgcc ttcggaattt ggtcacgatg tggacagagc 480 

. aaatcctgtg gaacctggcc taacaatgta caaacagaaa cgtttggtta gacgtgtgat 540 
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5T25 






cgaagaatct 


ggtataccat 


acacctacat 


ctgttgcaat 


tcgatcgcat 


cttggccgta 


600 


ctatgacaat 


tgtcatccat 


cacagcttcc 


tccaccgttg 


gatcaattac 


atatttatgg 


660 


tcatggcgat 


gtcaaagctt 


actttgttga 


tggctatgat 


attgggaaat 


tcacaatgaa 


720 


ggtcattgat 


gatgaaagaa 


caatcaacaa 


aaatgttcat 


tttcgacctt 


ctaacaattg 


780 


ttatagcatg 


aatgagcttg 


cttctttgtg 


ggaaaacaaa 


attgcacgaa 


aaattcctag 


840 


agtgatcgtc 


tctgaagacg 


atcttctagc 


aatagccgca 


gaaaattgca 


taccggaaag 


900 


tgtcgtggca 


ccaatcactc 


atgatatatt 


catcaatgga tgtcaagtta 


acttcaagat 


960 


agatggaatt 


catgatgttg 


aaattggcac 


tctatatcct 


ggtgaatcgg 


taagaagttt 


1020 


ggaggaatgc 


tatgagaaat 


ttgttgtcat 


ggcggctgac 


aagattcata 


aagaagaaac 


1080 


tggagttacc 


gcaggtgggg 


gcggcacaac 


ggctatggta 


gagccggtgc 


caatcacagc 


1140 


ttcctgttga 


aaaggttcac 


ctgaggtgga 


tattcttttg agtcataaga 


catgttgatt 


1200 


gttgatgttg 


ttttcaagaa 


tgtttcatca 


tttcatgtgt 


tttattaatc 


ctaagtacaa 


1260 


ataattgctg 


tctacgtacg 


ttcttagttg 


caaaaattct 


tgttattctc 


tattgaggta 


1320 


aaagtcttca 


tgtttacaaa 


aaaaaaaaaa 


aaaaaaaaaa 


aaaaaaaagt 


actctgcgtt 


1380 


gttaccactg 


cttaatcact 


agtgaattc 








1409 



<210> 12 
<211> 356 
<212> PRT 

<213> Tri folium repens 
<400> 12 

Met Ala Pro Ala Ala Thr ser Ser Pro Thr Thr Pro Thr Thr Thr Lys 
15 10 15 

Gly Arg val Leu lie Val Gly Gly Thr Gly Phe lie Gly Lys Phe val 
20 25 30 

Thr Glu Ala Ser Leu ser Thr Thr His Pro Thr Tyr Leu Leu Val Arg 
35 40 45 

Pro Gly Pro Leu Leu ser Ser Lys Ala Ala Thr lie Lys Ala Phe Gin 
50 55 60 

Glu Lys Gly Ala lie val lie Tyr Gly Arg val Asn Asn Lys Glu Phe 
65 70 75 80 

Met Glu Met lie Leu Lys Lys Tyr Glu lie Asn Val val lie Ser Ala 
85 ' * 90 95 

lie Gly Gly Ser Asp Gly Leu Leu Glu Gin Leu Thr Leu val Glu Ala 
100 105 110 

Met Lys Ser lie Asn Thr lie Lys Arg Phe Leu Pro Ser Glu Phe Gly 
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115 120 125 

His Asp Val Asp Arg Ala Asn Pro Val Glu Pro Gly Leu Thr Met Tyr 
130 135 140 

Lys Gin Lys Arg Leu. Val Arg. Arg Val lie Glu Glu Ser Gly lie Pro. 
145 150 ~ 155 160 

Tyr Thr Tyr lie cys cys Asn ser lie Ala Ser Trp Pro Tyr Tyr Asp 
165 170 175 

Asn Cys His Pro ser Gin Leu Pro Pro Pro Leu Asp Gin Leu His lie 
180 185 190 

Tyr Gly His Gly Asp val Lys Ala Tyr Phe val Asp Gly Tyr Asp lie 
195 200 205 

Gly Lys Phe Thr Met Lys val lie Asp Asp! Glu Arg Thr lie Asn Lys 

210 215 220. 

Asn Val His Phe Arg Pro Ser Asn Asn cys Tyr Ser Met Asn Glu Leu 

225 230 235 240 

Ala Ser Leu Trp Glu Asn Lys lie Ala Arg Lys lie Pro Arg Val lie 
245 250 255 

val ser Glu Asp Asp Leu Leu Ala lie Ala Ala Glu Asn Cys lie Pro 
260 265 270 

Glu ser Val val Ala Pro lie Thr His Asp lie Phe lie Asn Gly cys 

275 280 285 

Gin Val Asn Phe Lys lie Asp Gly lie His Asp Val Glu lie Gly Thr 
290 295 300 

Leu Tyr Pro Gly Glu Ser Val Arg ser Leu Glu Glu cys Tyr Glu Lys 
305 310 ~ 315 320 

Phe Val val Met Ala Ala Asp Lys lie His Lys Glu Glu Thr Gly Val 
325 330 335 

Thr Ala Gly Gly Gly Gly Thr Thr Ala Met Val Glu Pro Val Pro lie 
340 345 350 

Thr Ala Ser Cys 
355 

<210> 13 
<211> 1551 
<212> DNA 

<213> Tri folium repens 
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<400> 13 

gaattcgatt aagcagtggt 


aacaacgcag 


M80676490.ST25 
agtacgcggg aggatccttc 


cattttgcat 


60 


accaacattg 


tcacaattaa 


ctctaaaagc 


aaagcaatgg 


caccagcagc 


aacatcatca 


120 


ccaaccactc 


ctactactac 


caagggtcgt 


gtcctaattg 


ttggaggaac 


aggtttcatt 


180 


ggaaaatttg 


taactgaggc 


aagtctttcc 


acaacacacc 


caacctactt 


gttggttcgg 


240 


ccaggacctc 


ttctctcttc 


taaggctgcc 


actattaagg 


cattccaaga 


gaaaggtgcc 


300 


attgtcattt 


atggtcgggt 


aaataataag 


gagttcatgg 


agatgatttt 


gaaaaagtat 


360 


gagataaatg 


tagtcatttc 


tgcaatagga 


ggctctgatg 


gcttgctgga 


acagcttact 


420 


ttggtggagg 


ccatgaaatc 


tattaacacc 


attaagaggt 


ttttgccttc 


agaatttggt 


480 


cacgatgtgg 


acagagcaaa 


tcctgtggaa 


cctggcctaa 


caatgtacaa 


acagaaacgt 


540 


ttggttagac 


gtgtgatcga 


agaatctggt 


gtaccataca 


cctacatctg 


ttgcaattcg 


600 


atcgcatcct 


ggccgtacta 


tgacaattgt 


catccatcac 


agcttcctcc 


accgttggat 


660 


caattacata tttatggtca 


tggcgatgtc 


aaagcttact 


ttgttgatgg 


ctatgatatt 


720 


gggaaattca 


caatgaaggt 


cattgatgat 


gaaagaacaa 


tcaacaaaaa 


tgttcatttt 


780 


cgaccttcta 


acaattgtta 


tagcatgaat 


gagcttgctt 


ctttgtggga 


aaacaaaatt 


840 


gcacgaaaaa 


ttcctagagt 


gatcgtctct 


gaagacgatc 


ttctagcaat 


agccgcagaa 


900 


aactgcatac 


cggaaagtgt 


tgtggcatca 


atcactcatg 


atatattcat 


caatggatgt 


960 


caagttaact 


tcaaggtaga 


tggaattcat 


gatgttgaaa 


ttggcactct 


atatcctggt 


1020 


gaatcggtaa gaagtttgga 


ggaatgctat 


gagaaatttg 


ttgtcatggc 


ggctgacaag 


1080 


attcataaag 


aagaaactgg 


agttaccgca 


ggtgggggcg 


gcacaacggc 


tatggtagag 


1140 


ccggtgccaa tcacagcttc 


ctgttgaaaa 


ggttcacctg 


aggtggatat 


tcttttgagt 


1200 


cataagacat 


gttgattgtt 


gatgttgttt 


tcaagaatgt 


ttcatcattt 


catgtgtttt 


1260 


attaatccta agtacaaata 


attgctgtct 


acgtacgttc 


ttagttgcga 


aaattcttgt 


1320 


tattctctat 


tggggtaaaa 


gtcttcatgt 


ttattgtagt 


tgtgttggtt 


tttcatatat 


1380 


gctatttgca 


ataatgattt 


ttgtgaagca 


cttgtggtgt 


atttacttac 


tactgaaaat 


1440 


aatggttaca 


caaaatatat 


aaaaaaataa 


aaataagcaa 


aaaaaaaaaa 


aaaaaaaaaa 


1500 


aaaaaaaaaa gtactctgcg 


ttgttaccac 


tgcttaatca 


ctagtgaatt 


c 


1551 



<210> 14 
<211> 356 
<212> PRX 

<213> Tri foli urn repens 
<400> 14 

Met Ala pro Ala Ala Thr Ser ser Pro Thr Thr Pro Thr Thr Thr Lys 
15 10 15 

Gly Arq val Leu lie Val Gly Gly Thr Gly Phe lie Gly Lys Phe Val 
• 20 25 30 
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Thr Glu Ala ser Leu ser Thr Thr His Pro Thr Tyr Leu Leu Val Arg 
35 40 45 

Pro Gly Pro Leu Leu ser ser Lys Ala Ala Thr lie Lys Ala Phe Gin 
50 55 60 

Glu Lys Gly Ala lie Val lie Tyr Gly Arg Val Asn Asn Lys Glu Phe 
65 70 75- 80 

Met Glu Met lie Leu Lys Lys Tyr Glu lie Asn Val Val lie Ser Ala 
85 90 95 

lie Gly Gly ser Asp Gly Leu Leu Glu Gin Leu Thr Leu val Glu Ala 
100 105 110 

Met Lys ser lie Asn Thr lie Lys Arg Phe Leu Pro ser Glu Phe Gly 
115 120 125 

His Asp val Asp Arg Ala Asn Pro Val Glu Pro Gly Leu Thr Met Tyr 
130 135 140 

Lys Gin Lys Arg Leu val Arg Arg Val lie Glu Glu Ser Gly Val Pro 
145 150 155 160 

Tyr Thr Tyr lie Cys Cys Asn Ser lie Ala Ser Trp Pro Tyr Tyr Asp 
165 170 175 

Asn Cys His Pro ser Gin Leu Pro Pro Pro Leu Asp Gin Leu His lie 
180 185 190 

Tyr Gly His Gly Asp Val Lys Ala Tyr Phe Val Asp Gly Tyr Asp lie 
195 200 205 

Gly Lys Phe Thr Met Lys Val lie Asp Asp Glu Arg Thr lie Asn Lys 
210 215 220 

Asn Val His Phe Arg Pro ser Asn Asn Cys Tyr Ser Met Asn Glu Leu 
225 230 235 240 

Ala ser Leu Trp Glu Asn Lys lie Ala Arg Lys lie Pro Arg Val lie 
245 250 255 

Val ser Glu Asp Asp Leu Leu Ala lie Ala Ala Glu Asn Cys lie Pro 
260 265 270 

Glu ser val val Ala Ser lie Thr His Asp lie Phe lie Asn Gly Cys 
275 280 285 

Gin val Asn Phe Lys val Asp Gly lie His Asp val Glu lie Gly Thr 
290 295 300 
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Leu Tyr Pro Gly Glu ser val Arg Ser Leu Glu 
305 310 ~ 315 



Glu cys Tyr Glu Lys 
320 



Phe val val Met Ala Ala Asp Lys lie His Lys Glu Glu Thr Gly val 
325 330 335 



Thr Ala Gly Gly Gly Gly Thr Thr Ala Met Val Glu Pro val Pro lie 
340 345 350 



Thr Ala Ser cys 



<210> 15 
<211> 1384 
<212> DNA 

<213> Tri folium repens 
<400> 15 

gaattcgatt aagcagtggt aacaacgcag agtacgcggg gataccaaca ttgtcacaat 
taactctaaa agtaaagcaa tggcaccagc agcaacatca tcaccaacca ctcccactac 
taccaagggt cgtgtcctaa ttgttggagg aacaggtttc attggaaaat ttgtaactga 
ggcaagtctt tccacaacac acccaaccta cttgttggtt cggccaggac ctcttctctc 
ttctaaggct gccactatta aggcattcca agagaaaggt gccattgtca tttatggtcg 
ggtaaataat aaggagttca tggagatgat tttgaaaaag tatgagataa atgtagtcat 
ttctgcaata ggaggctctg atggcttgct ggaacagctt actttggtgg aggccatgaa 
atctattaac accattaaga ggtttttgcc ttcggaattt ggtcacgatg tggacagagc 
agatcctgtg gaacctggcc taacaatgta caaacagaaa cgtttggtta gacgtgtgat 
cgaagaatct ggtataccat acacctacat ctgttgcaat tcgatcgcat cttggccgta 
ctatgacaat tgtcatccat cacagcttcc tccaccgttg gatcaattac atatttatgg 
tcatggcgat gtcaaagctt actttgttga tggctatgat attgggaaat tcacaatgaa 
ggtcattgat gatgaaagaa caatcaacaa aaatgttcat tttcgacctt ctaacaattg 
ttatagcatg aatgagcttg cttctttgtg ggaaaacaaa attgcacgaa aaattcctag 
agtgatcgtc tctgaagacg atcttctagc aatagccgca gaaaattgca taccggaaag 
tgtcgtggca ccaatcactc atgatatatt catcaatgga tgtcaagtta acttcaagat 
agatggaatt catgatgttg aaattggcac tctatatcct ggtgaatcgg taagaagttt 
ggaggaatgc tatgagaaat ttgttgtcat ggcggctgac aagattcata aagaagaaac 
tggagttacc gcaggtgggg gcggcacaac ggctatggta gagccggtgc caatcacagc 
ttcctgttga aaaggttcac ctgaggtgga tattcttttg agtcataaga catgttgatt 
gttgatgttg ttttcaagaa tgtttcatca tttcatgtgt tttattaatc ctaagtacaa 
ataattgctg tctacgtacg ttcttagttg caaaaattct tgttattctc tatcaaaaaa 
aaaaaaaaaa aaaaaaaaaa aaagtactct gcgttgttac cactgcttaa tcactagtga 



60 
120 
180 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1O20 
1O80 
1140 
1200 * 
1260 
1320 
1380 



355 
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attc 

<210> 16 
<211> 356 
<212> PRT 

<213> Tri foli urn repens 
<400> 16 

Met Ala Pro Ala Ala Thr Ser Ser Pro Thr Thr Pro Thr Thr Thr Lys 
15 10 15 

Gly Arg val Leu lie Val Gly Gly Thr Gly Phe lie Gly Lys Phe Val 
20 25 30 

Thr Glu Ala ser Leu Ser Thr Thr His Pro Thr Tyr Leu Leu val Arg 

35. 40 45 

Pro Gly Pro Leu Leu Ser ser Lys Ala Ala Thr lie Lys Ala Phe Gin 
50 55 60 

Glu Lys Gly Ala lie val lie Tyr Gly Arg Val Asn Asn Lys Glu Phe 
65 70 75 80 

Met Glu Met lie Leu Lys Lys Tyr Glu lie Asn val val lie ser Ala 
85 90 95 

lie Gly Gly ser Asp Gly Leu Leu Glu Gin Leu Thr Leu val Glu Ala 
100 105 110 

Met Lys ser lie Asn Thr lie Lys Arg Phe Leu Pro Ser Glu Phe Gly 
115 120 125 

His Asp val Asp Arg Ala Asp Pro val Glu Pro Gly Leu Thr Met Tyr 
130 135 140 

Lys Gin Lys Arg Leu Val Arg Arg Val lie Glu Glu ser Gly lie Pro 
145 150 155 160 

Tyr Thr Tyr lie cys cys Asn ser lie Ala ser Trp Pro Tyr Tyr Asp 
165 170 175 

Asn cys His Pro ser Gin Leu Pro Pro Pro Leu Asp Gin Leu His lie 
180 185 190 

Tyr Gly His Gly Asp Val Lys Ala Tyr Phe Val Asp Gly Tyr Asp lie 
195 200 205 

Gly Lys Phe Thr Met Lys Val lie Asp Asp Glu Arg Thr lie Asn Lys 
210 215 220 

Asn val His Phe Arg Pro ser Asn Asn cys Tyr ser Met Asn Glu Leu 
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225 230 235 240 

Ala ser Leu Trp Glu Asn Lys lie Ala Arg Lys lie Pro Arg val lie 
245 250 255 

Val ser Glu Asp Asp Leu Leu Ala lie Ala Ala Glu Asn Cys lie Pro 
260 265 270 

Glu ser val val Ala Pro lie Thr His Asp lie Phe lie Asn. Gly Cys 
275 280 285 

Gin val Asn Phe Lys lie Asp Gly lie His Asp Val Glu lie Gly Thr 
290 295 300 

Leu Tyr Pro Gly Glu Ser val Arg ser Leu Glu Glu cys Tyr Glu Lys 
305 310 315 320 

Phe val val Met Ala Ala Asp Lys lie His Lys Glu Glu Thr Gly Val 
325 330 335 

Thr Ala Gly Gly Gly Gly Thr Thr Ala Met val Glu Pro val Pro lie 

340 345 350 

Thr Ala Ser Cys 
355 

<210> 17 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 
<400> 17 

aggaggctgc agtcaagg 18 

<210> 18 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 
<400> 18 

tgcctgaaat tgagaaacc 19 

<210> 19 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 
<400> 19 

Page 20 



WO 2004/090136 



PCT7AU2004/000494 



aaagctagcc ttgaagcc 



M80676490.ST25 



18 



<210> 
<211> 
<212> 



20 
19 
DNA 



<213> Artificial 
<220> 

<223> Primer sequence 
<400> 20 

tcggacataa ctcatgtgg 



19 



<210> 21 



<211> 
<212> 



18 
DNA 



<213> Artificial 
<220> 

<223> Primer sequence 
<400> 21 

ttgggttgga gaataagg 



18 



<210> 22 

<211> 18 

<212> DNA 

<213> Artificial 



<220> 
<223> 



Primer sequence 



<400> 22 

tggacattta ttggttgc 



18 



<210> 23 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

i <223> Primer sequence 

<400> 23 

tatcatgtct ggaaatgc 



18 



<210> 24 



<211> 
<212> 



19 
DNA 



<213> Artifi cial 
<220> 

<223> Primer sequence 
<400> 24 

agattgcatc aaagaatgg 



19 



<210> 25 

<211> 17 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer sequence 



<400> 25 

ggtccaaaag ccaatcc 



17 



<210> 26 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 26 

taagacgaga catagtgg 18 



<210> 27 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 27 

tattcactaa gcacatgc 18 



<210> 28 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 28 

tcatttctgc aataggagg 19 



<210> 29 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> primer sequence 



<210> 30 

<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 30 



<400> 29 

atccacctca ggtgaacc 



18 



aataggaggc tctgatgg 



18 



<210> 31 
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<211> 18 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 31 

atccacctca ggtgaacc 



18 



<210> 32 

<211> 17 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 32 

aggctctgat ggcttgc 



17 



<210> 
<211> 
<212> 



33 
18 
DNA 



<213> Artificial 
<220> 

<223> primer sequence 
<400> 33 

atccacctca ggtgaacc 



18 



<210> 34 



<211> 
<212> 



30 
DNA 



<213> Artificial 

<220> 

<223> Primer sequence 
<400> 34 

gaattctaga agatatggtg agtgtagctg 



30 



<210> 35 



<211> 
<212> 



30 
DNA 



<213> Artificial 
<220> 

<223> Primer sequence 
<400> 35 

gaattctaga atcacacatc ttatatagcc 



30 



<210> 36 

<211> 55 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 36 
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ggggacaagt ttgtacaaaa aagcaggctt ctagaagata tggtgagtgt agctg 



55 



<210> 37 

<211> 55 

<212> DNA 

<213> Artificial 



<220> 
<223> 



Primer sequence 



<400> 37 

ggggaccact ttgtacaaga aagctgggtt ctagaatcac acatcttata tagcc 



55 



<210> 
<211> 
<212> 



38 
33 
DNA 



<213> Artificial 



<220> 
<223> 



Primer sequence 



<400> 38 

gaattctaga agaagaaata tgggagacga agg 



33 



<210> 
<211> 
<212> 



<220> 
<223> 



39 

33 

DNA 



<213> Arti f i ci al 



Primer sequence 



<400> 39 

gaattctaga aagacttcat gcacacaagt tec 



33 



<210> 40 

<211> 34 

<212> DNA 

<213> Artificial 



<220> 

i <223> Primer sequence 
<400> 40 

gaattctaga tgattcattg tttgtttcca taac 



34 



<210> 
<211> 
<212> 



41 
31 
DNA 



<213> Artificial 
<220> 

<223> Primer sequence 
<400> 41 

gaattctaga acatattcat cttcctatca c 



31 



<210> 42 

<211> 31 

<212> DNA 

<213> Artificial 
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Primer sequence 



<400> 42 

gaattctaga tccaaattct cgtacctcac c 



31 



<210> 43 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 43 

gaattctaga tagttcacat ctctcggcag g 



31 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



44 
37 
DNA 

Artifi cial 



Primer sequence 



<400> 44 

ggatcctcta gagcactagt gtgtataagt ttcttgg 



37 



<210> 
<211> 
<212> 



45 
35 
DNA 



<213> Artificial 
<220> 

<223> Primer sequence 

<400> 45 

ggatcctcta gaccccctta gtcttaaaat actcg 



35 



<210> 
<211> 
<212> 



<220> 
<223> 



46 
52 
DNA 



<213> Artificial 



primer sequence 



<400> 46 

ggggacaagt ttgtacaaaa aagcaggctc tagaaagcaa agcaatggca cc 



52 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



47 
51 
DNA 

Artificial 



primer sequence 



<400> 47 

ggggaccact ttgtacaaga aagctgggtc tagatccacc tcaggtgaac c 



51 



<210> 48 



page 25 



WO 2004/090136 



PCT/AU2004/000494 



M80676490.ST25 

<211> 53 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 
<400> 48 

ggggacaagt ttgtacaaaa aagcaggctc tagaaagcaa tggcaccagc age 53 



<210> 49 

<211> 51 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 49 

ggggaccact ttgtacaaga aagctgggtc tagatccacc tcaggtgaac c 51 



J <210> 50 

<211> 52 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 50 

ggggacaagt ttgtacaaaa aagcaggctc tagataaagc aatggcacca gc 52 



<210> 51 

<211> 51 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 51 

ggggaccact ttgtacaaga aagctgggtc tagatccacc tcaggtgaac c 51 



<210> 52 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 52 

ccaccatgtt tgaaatttat tatgtgtttt tttccg 36 



<210> 53 

<211> 35 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 53 
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taatcccggg taagggcagc ccatacaaat gaagc 



<210> 54 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 54 

ataataaccg gttgatcatg agcggagaat taaggg 



<210> 55 

<211> 36 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 55 

ataatagcgg ccgctagtaa catagatgac accgcg 



<210> 56 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 56 

aatagcggcc gcgatttagt actggatttt gg 



<210> 57 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 57 

aataaccggt acccacgaag gagcatcgtg g 



<210> 58 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 58 

ataataaccg gtgcccgggg atctcctttg cc 



<210> 59 
<211> 36 
<212> DNA 
<213> Artificial 
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<220> 
<223> 



M80676490.ST25 



Primer sequence 



<400> 59 

ataatagcgg ccgcatgcat , gttgtcaatc aattgg 



36 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



60 
34 
DNA 

Artificial 



Primer sequence 



<400> 60 

taataccggt aaatttatta tgrgtttttt tccg 



34 



<210> 
<211> 



61 
37 



<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 
<400> 61 

taatgcggcc gctaagggca gcccatacaa atgaagc 



37 



<210> 62 

<211> 23 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 62 

catttcattt ggagaggaca cgc 



23 



<210> 63 

<211> 21 

I <212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 63 

aacacggttt ggtggatttg c 



21 



<210> 64 

<211> 22 

<212> DNA 

<213> Arti f i ci al 

<220> 

<223> Primer sequence 

<400> 64 

ttggagagga cacgctgaaa tc 



22 



<210> 65 
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<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 65 

acaagttggt gagggaatgc c 



M80676490.ST25 



21 



<210> 66 

<211> 23 

<212> DNA 

<213> Arti f i ci al 

<220> 

<223> Primer sequence 

<400> 66 

catttcattt ggagaggaca cgc 



23 



<210> 67 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 67 

tcgttgcctt tccctgagta gg 



22 



<210> 68 

<211> 21 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 68 

tcatttggag aggacacgct g 



21 



<210> 
<211> 
<212> 



69 
24 
DNA 



<213> Artificial 
<220> 

<223> Primer sequence 
<400> 69 

cggtcaccat ttttttgttg gagg 



24 



<210> 70 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 70 
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ttggagagga cacgctgaaa tc 22 

<210> 71 

<211> 19 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 71 

caacaaaacc agtgccacc 19 

<210> 72 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 72 

atgacgcaca atcccactat cc 22 

<210> 73 

<211> 24 

<212> DNA 

<213> Artificial 
<220> 

<223> Primer sequence 

<400> 73 

agccttagaa gagagaagag gtcc 24 

<210> 74 

<211> 22 

<212> DNA 

<213> Artificial 

<220> 

, <223> Primer sequence 

<400> 74 

atgacgcaca atcccactat cc 22 

<210> 75 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 75 

agccttagaa gagagaagag gtcc 24 

<210> 76 

<211> 22 

<212> DNA 

<213> Artificial 
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<220> 

<223> Primer sequence 
<400> 76 

atgacgcaca atcccactat cc 



<210> 77 

<211> 24 

<212> DNA 

<213> Artificial 

<220> 

<223> Primer sequence 

<400> 77 

agccttagaa gagagaagag gtcc 



M80676490.ST25 



22 



24 
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International application No. 
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A. CLASSIFICATION OF SUBJECT MATTER 
Int.Cl. 7: C12N 15/29 

According to International Patent Classification (IPC) or to both national classification and IPC 

B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
SEE ELECTRONIC DATABASES BELOW 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 
SEE ELECTRONIC DATABASES BELOW 

Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
DGENE, Genebank, EMBL, SwissProt, PIR, PubMed : SEQ ID 2, 4, 6, 8, 10, 12, 14, 16 ; Keywords: chalcone 
synthase, leucoanthocyanidin reductase, lcr, ban, dihydroflavonol 4-reductase, clover, trifolium. 



C. 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



X 



X 



X 



Citation of document, with indication, where appropriate, of the relevant passages 



WO, A, 2003031622 (AGRICULTURE VICTORIA SERVICES PTY LTD ET.AL.) 17 
April 2003 (SEQ ID 2 has 100% identity with Figure 143, SEQ ID 4 has 96% identity 
with Fig. 148, SEQ ID 6 has 93% identity with Figure 158, SEQ ID 8 has 94% identity 
with Figure 163, SEQ ID 10 has 100% identity with Figure 118,) 

WO, A, 2002057418 (THE SALK INSTITUTE FOR BIOLOGICAL STUDIES) 25 
July 2002 (SEQ ID 2 has 92% identity with SEQ ID 1 on page 13) 

SWISS-PROT database Accession Number P5 1088, Chalcone synthase 6. Howies, P. 
A. et al. 1 October 1996 (98% identity with SEQ ID 2) 



SWISS-PROT database Accession Number P5 1083, Chalcone synthase 1. Arioli, T., et. 
al. 1 October 1996 (97% identity with SEQ ID 2) 



Relevant to 
claim No. 
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1-3,6-8,10- 
17,19-23,25- 
29 

1-3,6-8,10- 
17,19-23,25- 
29 

1-3,6-8,10- 
17,19-23,25- 
29 



X Further documents are listed in the continuation of Box C 


X 


See patent family annex 


* Special categories of cited documents: 
"A" document defining the general state of the art which is "T" later document published after the international filing date or priority date and not in 
not considered to be of particular relevance conflict with the application but cited to understand the principle or theory 

underlying the invention 

"E" earlier application or patent but published on or after the "X" document of particular relevance; the claimed invention cannot be considered novel 
international filing date or cannot be considered to involve an inventive step when the document is taken 

alone 

"L" document which may throw doubts on priority claim(s) "Y" document of particular relevance; the claimed invention cannot be considered to 
or which is cited to establish the publication date of involve an inventive step when the document is combined with one or more other 
another citation or other special reason (as specified) such documents, such combination being obvious to a person skilled in the art 

"O" document referring to an oral disclosure, use, exhibition rf ^ ^ 
or other means 

"P" document published prior to the international filing date 
hut lptpr than the priority date claimed 


Date of the actual completion of the international search 
28 June 2004 


Date of mailing of the international search report , 

8 JUL ZuU4 


Name and mailing address of the ISA/AU 

AUSTRALIAN PATENT OFFICE 
PO BOX 200, WODEN ACT 2606, AUSTRALIA 
E-mail address: pct@ipaustralia.gov.au 
Facsimile No. (02) 6285 3929 


Authorized officer 

ALISTAIR BESTOW 

Telephone No : (02) 6283 2450 
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GenPept database Accession Number CAA10131 Chalcone synthase, Dopico, B., et. al. 
11 November 1998 (97% identity with SEQ ID 2) 

SWISS-PROT database Accession Number Q9SEVO Leucoanthocyanidin reductase, 
Devic, M., et. al.'28 February 2003 (61% identity with SEQ ID 10) 

GenPept database Accession Number AAF23859 1 1 January 2000 (61% identity with 
SEQ ID 10) 

JEZ, J. M. et.al. Structure and Mechanism of Chalcone Synthase-like Polyketide 
Synthases, Journal of Industrial Microbiology* & Biotechnology (2001) 27:393-8 

ARIOLI, T., et.al. Li Trifolium subterraneum, chalcone synthase is encoded by a 
multigene family. Gene, 138:79-86 (1994) (SEQ ID 2 is 97% identity to seq. of Fig. 3) 

CHARRJER, B., et.al. Molecular characterization and expression of alfalfa (Medicago 
sativa L.) flavanone-3 -hydroxylase and dihydroflavonol-4-reductase encoding genes. 
Plant Molecular Biology 1995 Nov;29(4):773-86 (See Figure 3) 

WO Al 2002066625 (COMMONWEALTH SCIENTIFIC AND INDUSTRIAL 
RESEARCH ORGANISATION ET. AL.) 29 August 2002 (SEQ ID 12 and 14 and 16 
have 72% identity to SEQ ID 28) 

WO Al 2000022099 (GENESIS RESEARCH AND DEVELOPMENT 
CORPORATION LIMITED et. al.) 20 April 2000, (SEQ ID 12 and 14 have 56% 
identity to SEQ ID 323. SEQ ID 16 has 57% identity to SEQ ID 323) 

EP A 1033405 (CERES INC.) 6 September 2000. (SEQ ID 4 has 77% identity with 
SEQ ID 66257 and SEQ ID 15839, SEQ ID 6 has 67% identity with SEQ ID 7420 and 
7419) 

WO A 2002010210 (BAYER AG) 7 February 2002. (SEQ ID 4 has 77% identity with 
SEQ ID 245 1, SEQ ID 6 has 70% identity with SEQ ID 1 8.) 

EP A 1254960 (UNILEVER PLC) 6 November 2002. (SEQ ID 8 has 81% identity 
with Fig. 11.) 

WO A 2003040306 (GENESIS RESEARCH AND & DEVELOPMENT 
CORPORATION LTD) 15 May 2003. (SEQ ID 8 has 78% identity with SEQ ED 119 
and SEQ ID 186.) 

WO A 2004020637 (INTERNATIONAL FLOWER DEVELOPMENTS PTY. LTD.) 
19 March 2004. (SEQ ID 8 has 80% identity with disclosure on p.244-6.) 



PIR database Accession Number T1023 1 Anther-specific protein homolog Tl 111 1 .90, 
Bevan, M., et. al. 16 July 1999. (77% identity with SEQ ID 4.) 
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1-3,6-8,10- 
17,19-23,25- 
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1,2,4,6,7,9- 
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PIR database Accession Number T15054. Anther-specific protein - wood tobacco. 
Atanassov, LI., et. al. 20 September 1999. (76% identity with SEQ ID 6.) 



EMBL database Accession Number Y14507, Nicotiana sylvestris CHSLK gene. 
Atanassov LL, et. al. 1 1 August 1997. (70% identity with SEQ ID 6.) 



SWISS-PROT database Accession Number P51075, Chalcone synthase. Pellinen, R., 
et. al. 1 October 1996. (82% identity with SEQ ID 8.) 



SWISS-PROT database Accession Number Q9FSB7, Chalcone synthase 3. Springob, 
K., et. al. (82% identity with SEQ ID 8), and, SPRINGOB, K., et. al. Specificities of 
functionally expressed chalcone and acridone synthases from Rata graveolens, Eur. J. 
Biochem. 267(22): 6552-9 (2000) 



TREMBL database Accesssion Number BAB841 12. Chalcone synthase (Vitis 
vinifera). Goto-Yamamoto, N., et. ah 19 July 2002. (82% identity with SEQ ID 8.) 
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Box No. I Nucleotide and/or amino acid sequence(s) (Continuation of item l.b of the first sheet) 

1. With regard to any nucleotide and/or amino acid sequence disclosed in the international application and necessary to the 
claimed invention, the international search was carried out on the basis of: 

a. type of material 

| X | a sequence listing 

| | table(s) related to the sequence listing 

b. format of material 

[ j| in written format 

| X [ in computer readable form 

c. time of filing/furnishing 

| X | contained in the international application as filed 

| | filed together with the international application in computer readable form 
| ] furnished subsequently to this Authority for the purposes of search 

2. | In addition, in the case that more than one version or copy of a sequence listing and/or table(s) relating thereto has been 

filed or furnished, the required statements that the information in the subsequent or additional copies is identical to that in 
the application as filed or does not go beyond the application as filed, as appropriate, were furnished. 

3 . Additional comments : 
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Box No. II Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 

This international search report has not been established in respect of certain claims under Article 17(2)(a) for the following 
reasons: 

1. | | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



n 



Claims Nos,: 



because they relate to parts of the international application that do not comply .with the prescribed requirements to such 
an extent that no meaningful international search can be carried out, specifically: 



3. [ ] Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a) 
Box No. Ill Observations where unity of invention is lacking (Continuation of item 3 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

The Applicant has claimed more than one invention. Rule 13.1 of the PCT states the principle that an International 
Application should relate to only one invention or, if there is more than one invention, that the inclusion of those 
inventions in one International Application is only permitted if all inventions are so linked as to form a single general 
inventive concept. 

(continued on Extra Sheet) 

As all required additional search fees were timely paid by the applicant, this international search report covers all 
searchable claims. 

As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite 
payment of any additional fee. 

As only some of the required additional search fees were timely paid by the applicant, this international search report 
covers only those claims for which fees were paid, specifically claims Nos. : 



4 j ] No required additional search fees were timely paid by the applicant. Consequently, this international search report is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 



Remark on Protest | J additional search fees were accompanied by the applicant's protest. 

[x | No protest accompanied the payment of additional search fees. 
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Supplemental Box 

(To be used when the space in any of Boxes I to VIII is not sufficient) 

Continuation of Box No: in 

Rule 13.2 of the PCT defines the method for deteraiining whether the requirement of unity of invention is satisfied in 
respect of a group of inventions claimed in an International application. Unity of invention exists only when there is a 
technical relationship among the claimed inventions involving one or more of the same or corresponding "special 
technical features." The expression "special technical features" is defined in Rule 13.2 as meaning those technical 
features that define a contribution which each of the inventions, considered as a whole, makes over the prior art. The 
determination is made on the contents of the claims as interpreted in light of the description and drawings (if any). 

Invention 1 : Polypeptide and polynucleotide encoding chalcone synthase of the formula TrCHSa3, represented by 
SEQ ID NOS 1 and 2. This invention is recited in claims 1 - 3, 6 - 8, 10 - 17, 19 - 23, 25 - 29 (in part). 

Invention 2 : Polypeptide and polynucleotide encoding chalcone synthase of the formula TrCHSc, represented by SEQ 
ID NOS 3 and 4. This invention is recited in claims 1 - 3, 6 - 8, 10 - 17, 19 - 23, 25 - 29 (in part). 

Invention 3 : Polypeptide and polynucleotide encoding chalcone synthase of the formula TrCHSf, represented by SEQ 
ID NOS 5 and 6. This invention is recited in claims 1 - 3, 6 - 8, 10 - 17, 19 - 23, 25 - 29 (in part). 

Invention 4 : Polypeptide and polynucleotide encoding chalcone synthase of the formula TrCHSh, represented by 
SEQ ID NOS 7 and 8. This invention is recited in claims 1 - 3, 6 - 8, 10 - 17, 19 - 23, 25 - 29 (in part). 

Invention 5 : Polypeptide and polynucleotide encoding dihydroflavonol 4-reductase of the formula TrBANa, 
represented by SEQ ID NOS 9 and 10. This invention is recited in claims 1, 2, 4, 6 - 7, 9, 10 - 16, 18 - 22, 24 - 30. (in 
part) 

Invention 6 : Polypeptide and polynucleotide encoding leucoanthrocyanine reductase of the formula TrLARa, 
represented by SEQ ED NOS 1 1 and 12. This invention is recited in claims 1,2,5,6,8- 15, 17-21,23 - 26, 28, 29, 
30. (in part). 

Invention 7 : Polypeptide and polynucleotide encoding leucoanthrocyanine reductase of the formula TrLARb, 
represented by SEQ ID NOS 13 and 14. This invention is recited in claims 1, 2, 5, 6, 8 - 15, 17 - 21, 23 - 26, 28, 29, 
30. (in part). 

Invention 8 : Polypeptide and polynucleotide encoding leucoanthrocyanine reductase of the formula TrLARc, 
represented by SEQ ID NOS 15 and 16. This invention is recited in claims 1, 2, 5, 6, 8 - 15, 17 - 21, 23 - 26, 28, 29, 
30. (in part). 

Each of the above enzymes may be involved in flavonoid biosynthesis in plants, and more specifically the modification 
of the content of condensed tannins, but this not novel, as noted in the following abstract. Therefore this cannot be 
used as a special technical feature providing unity to all of the sequences. 

XIE DY et. al. Science (Jan 2003)299(5605):396-9 Role of anthocyanin reductase, encoded by BANYULS in plant 
flavonoid biosynthesis. 

Although some of the claims share a technical feature whereby they are a chalcone synthase, a dihydroflavonol 4- 
reductase or a leucoanthocyanine reductase, these are not special technical features because none of these are novel 
enzymes. For example, the following three journal abstracts indicate that none of these enzymes are novel, and 
therefore cannot be used as a special technical feature uniting the inventions into three groups. 

NAPOLI C, et. al. The Plant Cell (1990) 2:279-89 Introduction of a Chimeric Chalcone Synthase Gene into Petunia 
Results in Reversible Co-suppression of Homologous Genes in trans. 

JOHNSON ET, et. al. Plant Journal (2001) 25(3):325-33 Alteration of a single amino acid changes the substrate 
specificity of dihydroflavonol 4-reductase. ' 

TANNER GJ et.al. Anal Biochem 1993 March 209(2):274-7 Synthesis of 3,4-cis-[3H]leucocyanidin and enzymatic 
reduction to catechin 

(Continued on Extra Sheet.) 
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(To be used when the space in any of Boxes I to VIII is not sufficient) 



Continuation of Box No: 

Furthermore, given the nature of the claims and the invention, it is appropriate to apply the Markush approach. Using 
the Markush approach to analyse the unity of inventions, although the 8 gene transcripts all have the common property 
(A) that they are involved in flavonoid biosynthesis in plants, and more specifically the modification of the content of 
condensed tannins, (B)(1) there is no common structure present in all of the genes; and (B)(2) there is no single 
recognised class or group of compounds embracing all the genes claimed. It is contrary to normal classification to 
group together such diverse genes. Thus according to Markush, it is appropriate to classify the genes in terms of the 8 
individual groups and thus these groups represent 8 different inventions. 

In order to search each of the inventions, this could only be done by consideration of each of the sequences, thereby 
requiring eight separate searches. As a service to the applicant, this office would for the purposes of this search, 
consider Inventions 1 and 5 together for a single search fee. Each of the other sequences will be considered as single 
inventions and each attracts a search fee. 

The applicant chose to pay the six extra fees being sufficient to search and examine all eight inventions. 
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